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(57) Abstract 

A specific region of chromosome 10 (10q23.3) has been implicated by series of studies to contain a tumor suppressor gene involved 
in gliomas, as well as a number of other human cancers. One gene within this region was identified, and the corresponding coding region 
of the gene represents a novel 47 kD protein. A domain of this product has an exact match to the conserved catalytic domain of protein 
tyrosine phosphatases, indicating a possible functional role in phosphorylation events. Sequence analyses demonstrated that a number of 
exons of the gene were deleted in tumor cell lines used to define the 10q23.3 region, leading to the classification of this gene as a tumor 
suppressor. Further analyses have demonstrated the presence of a number of mutations in the gene in both glioma and prostate carcinoma 
cells. Methods for diagnosing and treating cancers related to this tumor suppressor, designated as TS10q23.3, also are disclosed. 
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A TUMOR SUPPRESSOR DESIGNATED TS10Q23.3 

BACKGROUND OF THE INVENTION 
L Field of the Invention 

The present invention relates to the fields of oncology, genetics and molecular biology. 
More particular the invention relates to the identification, on human chromosome 10, of a tumor 
suppressor gene. Defects in this gene are associated with the development of cancers, such as 
gliomas. 

II. Related Art 

Oncogenesis was described by Foulds (1958) as a multistep biological process, which is 
presently known to occur by the accumulation of genetic damage. On a molecular level, the 
multistep process of tumorigenesis involves the disruption of both positive and negative 
regulatory effectors (Weinberg, 1989). The molecular basis for human colon carcinomas has 
been postulated, by Vogelstein and coworkers (1990), to involve a number of oncogenes, tumor 
suppressor genes and repair genes. Similarly, defects leading to the development of 
retinoblastoma have been linked to another tumor suppressor gene (Lee et ai, 1987). Still other 
oncogenes and tumor suppressors have been identified in a variety of other malignancies. 
Unfortunately, there remains an inadequate number of treatable cancers, and the effects of cancer 
are catastrophic — over half a million deaths per year in the United States alone- 
One example of the devastating nature of cancer involves tumors arising from cells of the 
astrocytic lineage are the most common primary tumors of the central nervous system (Russell & 
Rubinstein, 1989). The majority of these tumors occur in the adult population. Primary brain 
tumors also account for the most common solid cancer in the pediatric patient population and 
the second leading cause of cancer deaths in children younger than 15 years of age. An 
estimated 18,500 new cases of primary brain tumors were diagnosed in 1994 (Boring et al. t 
1994). Epidemiological studies show that the incidence of brain tumors is increasing and 
represents the third most common cause of cancer death among 18 to 35 year old patients. Due 
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to their location within the brain and the typical infiltration of tumor cells into the surrounding 
tissue, successful therapeutic intervention for primary brain tumors often is limited. 
Unfortunately, about two-thirds of these afflicted individuals will succumb to the disease within 
two years. The most common intracranial tumors in adults arise from cells of the glial lineage 
and occur at an approximately frequency of 48% glioblastoma multiforme (GBM), 21% 
astrocytomas (A) (anaplastic (AA) and low grade) and 9% ependymomas and 
oligodendrogliomas (Levin et al t 1993). 

Genetic studies have implicated several genes, and their corresponding protein products, 
in the oncogenesis of primary brain tumors. Among the various reported alterations are: 
amplification of epidermal growth factor receptor and one of its ligands, transforming growth 
factor-alpha, N-myc; gli, altered splicing and expression of fibroblast growth factor receptors, 
and loss of function of p53, pl6, Rb, neurofibromatosis genes 1 and 2, DCC, and putative tumor 
suppressor genes on chromosomes 4, 10, 17 (non- P 53), 19, 22, and X (Wong et al, 1987; El- 
Azouzi et al, 1989; Nishi et al, 1991; James et al, 1988; Kamb et al, 1994; Henson et al, 
1994; Yamaguchi et al, 1994; Bianchi et al, 1994; Ransom et al, 1992; Rasheed et al, 1992; 
Scheck and Coons, 1993; Von Demling et al, 1994; Rubio et al, 1994; Ritland et al, 1995). 

The most frequent alterations include amplification of EGF-receptor (-40%), loss of 
function of p53 (-50%), P 16 (-50%), Rb (-30%) and deletions on chromosome 10 (>90%). 
Furthermore, the grade or degree of histological malignancy of astrocytic tumors correlates with 
increased accumulation of genetic damage similar to colon carcinoma. Moreover, some changes 
appear to be relatively lineage- or grade-specific. For instance, losses to chromosome 19q occur 
predominantly in oligodendrogliomas, while deletions to chromosome 10 and amplification and 
mutation of the EGF-receptor occur mainly in GBMs. The deletion of an entire copy or 
segments of chromosome 10 is strongly indicated as the most common genetic event associated 
with the most common form of primary brain tumors, GBMs. 

Cytogenetic and later allelic deletion studies on GBMs clearly have demonstrated 
frequent and extensive molecular genetic alterations associated with chromosome 10 (Bigner et 
al, 1988; Ransom et al, 1992; Rasheed et al, 1992; James et al, 1988: Fujimoto et al, 1989 
Fults et al, 1990, 1993; Karlbom et al, 1993; Rasheed et al, 1995; Sonoda et al, 1996; 
Albarosa et al, 1996). Cytogenetic analyses have clearly shown the alteration of chromosome 
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10 as a common occurrence in GBMs, with 60% of tumors exhibiting alteration. Allelic deletion 
studies of GBMs have also revealed very frequent allelic imbalances associated with 
chromosome 10 (90%). However, the losses are so extensive and frequent that no chromosomal 
sublocalization of a consistent loss could be unequivocally defined by these analyses. 

5 Several recent studies have implicated the region 10q25-26, specifically a 17 cM region 

from D10S190 to D10S216. A telomeric region from D10S587 to D10S216 was implicated by 
allelic deletion analysis using a series of low and high grade gliomas that exhibited only a partial 
loss of chromosome 10 (Rasheed et aL, 1995). This region (~lcM) was lost or noninformative 
in 11 GBM's, 4 AA's, 1 A and 1 oligodendroglioma, suggesting localization of a candidate 

10 region. This study also illustrated that deletions to chromosome 10 occur in lower grade 
gliomas. Albarosa et aL (1996) suggest a centromeric candidate region based on a small allelic 
deletion in a pediatric brain tumor from the makers D10S221 to D10S209. Saya et aL, using a 
series of GBMs, have suggested two common regions of deletions, 10q26 and 10q24 
(D10S192). 

15 The short arm of chromosome 10 also has been implicated to contain another tumor 

suppressor gene. Studies first provided functional evidence of a tumor suppressor gene on lOp 
in glioma (Steck et aL, 1995) which was later shown for prostate (Sanchez et aL, 1995; 
Murakami et aL, 1996). The latter study has implicated a 1 1 cM region between D10S1 172 and 
D10S527. Allelic deletion studies of gliomas have shown extensive deletion on lOp, but again, 

20 no firm localization has been achieved (Karlbom et aL, 1993; Kimmelman et aL, 1996; these 
regions of chromosome 10 are shown to FIG. 1, below). Furthermore, the amplification of 
EGF-receptor has also been shown to occur almost exclusively in tumors that had deletions in 
chromosome 10, suggesting a possible link between these genetic alterations (Von Deimling et 
aL, 1992). 

25 Deletions on the long arm, particularly 10q24, also have been reported for prostate, 

renal, uterine, small-cell lung, endometrial carcinomas, meningioma and acute T-cell leukemias 
(Carter et aL, 1990; Morita et al, 1991; Herbst et aL, 1984; Jones et aL, 1994; Rempel et aL, 
1993; Peiffen et aL, 1995; Petersen et aL, 1997). Recently, detailed studies on prostate 
carcinoma, have shown that (1) the short and long arm of chromosome 10 strongly appear to 

30 contain tumor suppressor genes, and (2) the localization of the long arm suppressor gene maps 
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to the 10q23-24 boundary (Gray et al, 1995; Ittmann, 1996, Trybus et al, 1996). The region of 
common deletion identified by these three groups is centered around D10S215 and extends 
about 10 cM (FIG. 1). The region overlaps with our candidate region, however, no further 
localization within the region was reported fro prostate carcinoma. The allelic losses associated 
with prostate carcinoma also appear to occur in only about 30-40% of the tumors examined. 
Furthermore, deletions are observed in more advance staged tumors, similar to GBMs, and may 
be related to metastatic ability (Nihei et al, 1995; Komiya et al, 1996). The combination of 
these results suggest that multiple human cancers implicate the region 1 0q23-24. 

Differences in locations of the candidate regions suggest several possibilities. First, the 
presence of two or more tumor suppressor genes on lOq are possible. Second, not all deletions 
may effect the tumor suppressor gene locus. These alternatives are not mutually exclusive. In 
support of the latter possibility, a potential latent centromere was suggested to occur at 10q25 
which may give rise to genetic alterations, particularly breakage (Vouillaire et al, 1993). 

Despite all of this information, the identity of the gene (or genes) involved with the 
15 10q23-24-related tumor suppression remained elusive. Without identification of a specific gene 
and deduction of the protein for which it codes, it is impossible to begin developing an effective 
therapy targeting this product. Thus, it is an important goal to isolate the tumor suppressors) 
located in this region and determine its structure and function. 

SUMMARY OF THE INVENTION 

20 Therefore, it is an objective of the present invention to provide a tumor suppressor, 

designated as TSl0q23.3. Also an objective to provide DNAs representing all or part of a gene 
encoding TS10q23.3. It also is an objective to provides methods for using these compositions. 

In accordance with the foregoing objectives, there is provided, in one embodiment, a 
tumor suppressor designated as TS10q23.3. The polypeptide has, in one example, the amino 
25 acid sequence as set forth in FIG. 7 or FIG. 9. Also provided is an isolated peptide having 
between about 10 and about 50 consecutive residues of a tumor suppressor designated as 
TS10q23.3. The peptide may be conjugated to a carrier molecule, for example, KLH or BSA. 

In another embodiment, there is provided a monoclonal antibody that binds 
immunologically to a tumor suppressor designated as TS10q23.3. The antibody may be non- 
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cross reactive with other human polypeptides, or it may bind to non-human TS10q23.3, but not 
to human TS10q23.3. The antibody may further comprise a detectable label, such as a 
fluorescent label, a chemiluminescent label, a radiolabel or an enzyme. Also encompassed are 
hybridoma cells and cell lines producing such antibodies. 

In another embodiment, there is included a polyclonal antisera, antibodies of which bind 
immunologically to a tumor suppressor designated as TS10q23.3. The antisera may be derived 
from any animal, but preferably is from other than human, mouse or dog. 

In still another embodiment, there is provided an isolated nucleic acid comprising a 
region, or the complement thereof, encoding a tumor suppressor designated TS10q23.3 or an 
allelic variant or mutant thereof. The tumor suppressor coding region may be derived from any 
mammal but, in particular embodiments, is selected from murine, canine and human sequences. 
Mutations include deletion mutants, insertion mutants, frameshift mutants, nonsense mutants, 
missense mutants or splice mutants. In a particular embodiment, the tumor suppressor has the 
amino acid sequence of FIG. 9. The nucleic acid may be genomic DNA, complementary DNA 
or RNA. 

In additional embodiments, the nucleic acid comprises a complementary DNA and further 
comprises a promoter operably linked to the region, or the complement thereof, encoding the 
tumor suppressor. Additional elements include a polyadenylation signal and an origin of 
replication. 

Viral vectors such as retrovirus, adenovirus, herpesvirus, vaccinia virus and adeno- 
associated virus also may be employed. The vector may be "naked" or packaged in a virus 
particle. Alternatively, the nucleic acid may comprise an expression vector packaged in a 
liposome. 

Various sizes of nucleic acids are contemplated, but are not limiting: about 1212 bases, 
about 1500 bases, about 2000 bases, about 3500 bases, about 5000 bases, about 10,000 bases, 
about 15,000 bases, about 20,000 bases, about 25,000 bases, about 30,000 bases, about 35,000 
bases, about 40,000 bases, about 45,000 bases, about 50,000 bases, about 75,000 bases and 
about 100,000 bases. 
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In still yet another embodiment, there is provided an isolated oligonucleotide of between 
about 10 and about 50 consecutive bases of a nucleic acid, or complementary thereto, encoding 
a tumor suppressor designated as TS10q23.3. The oligonucleotide may be about 15 bases in 
length, about 17 bases in length, about 20 bases in length, about 25 bases in length or about 50 
bases in length. 

In still yet another embodiment, there is provided a method of diagnosing a cancer 
comprising the steps of (i) obtaining a sample from a subject; and (ii) determining the expression 
a functional TSl0q23.3 tumor suppressor in cells of the sample. The cancer may be brain, lung, 
liver, spleen, kidney, lymph node, small intestine, pancreas, blood cells, colon, stomach, breast,' 
endometrium, prostate, testicle, ovary, skin, head and neck, esophagus, bone marrow and blood 
cancer. In preferred embodiments, the cancer is prostate cancer or breast cancer. In another 
preferred embodiment, cancer is a brain cancer, for example, a glioma. The sample is a tissue or 
fluid sample. 

In one format, the method involves assaying for a nucleic acid from the sample. The 
method may further comprise subjecting the sample to conditions suitable to amplify the nucleic 
acid. Alternatively, the method may comprises contacting the sample with an antibody that binds 
immunologically to a TS10q23.3, for example, in an ELISA. The comparison, regardless of 
format, may include comparing the expression of TS10q23.3 with the expression of TSl0q23.3 
in non-cancer samples. The comparison may look at levels of TS10q23.3 expression. 
Alternatively, the comparison may involve evaluating the structure of the TS10q23.3 gene, 
protein or transcript. Such formats may include sequencing, wild-type oligonucleotide 
hybridization, mutant oligonucleotide hybridization, SSCP, PCR and RNase protection. 
Particular embodiments include evaluating wild-type or mutant oligonucleotide hybridization 
where the oligonucleotide is configured in an array on a chip or wafer. 

In another embodiment, there is provided a method for altering the phenotype of a tumor 
cell comprising the step of contacting the cell with a tumor suppressor designated TSl0q23.3 
under conditions permitting the uptake of the tumor suppressor by the tumor cell. The tumor 
cell may be derived from a tissue such as brain, lung, liver, spleen, kidney, lymph node, small 
intestine, blood cells, pancreas, colon, stomach, breast, endometrium, prostate, testicle, ovary, 
skin, head and neck, esophagus, bone marrow and blood tissue. The phenotype may be selected 
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from proliferation, migration, contact inhibition, soft agar growth or cell cycling. The tumor 
suppressor may be encapsulated in a liposome or free. 

In another embodiment, there is provided a method for altering the phenotype of a tumor 
cell comprising the step of contacting the cell with a nucleic acid (i) encoding a tumor suppressor 
designated TS10q23.3 and (ii) a promoter active in the tumor cell, wherein the promoter is 
operably linked to the region encoding the tumor suppressor, under conditions permitting the 
uptake of the nucleic acid by the tumor cell. The phenotype may be proliferation, migration, 
contact inhibition, soft agar growth or cell cycling. The nucleic acid may be encapsulated in a 
liposome. If the nucleic acid is a viral vector such as retrovirus, adenovirus, adeno-associated 
virus, vaccinia virus and herpesvirus, it may be encapsulated in a viral particle. 

In a further embodiment, there is provided a method for treating cancer comprising the 
step of contacting a tumor cell within a subject with a tumor suppressor designated TS10q23.3 
under conditions permitting the uptake of the tumor suppressor by the tumor cell. The method 
may involve a human subject. 

In still a further embodiment, there is provided a method for treating cancer comprising 
the step of contacting a tumor cell within a subject with a nucleic acid (i) encoding a tumor 
suppressor designated TS10q23.3 and (ii) a promoter active in the tumor cell, wherein the 
promoter is operably linked to the region encoding the tumor suppressor, under conditions 
permitting the uptake of the nucleic acid by the tumor cell. The subject may be a human. 

In still yet a further embodiment, there is provided transgenic mammal in which both 
copies of the gene encoding TS10q23.3 are interrupted or replaced with another gene. 

In an additional embodiment, there is provided a method of determining the stage of 
cancer comprising the steps of (i) obtaining a sample from a subject; and (ii) determining the 
expression a functional TS10q23.3 tumor suppressor in cells of the sample. The cancer may be a 
brain cancer and the stage is distinguished between low grade and glioma. The determining may 
comprise assaying for a TS10q23.3 nucleic acid or polypeptide in the sample. 

In yet an additional example, there is provided a method of predicting tumor metastasis 
comprising the steps of (i) obtaining a sample from a subject; and (ii) determining the expression 
a functional TS10q23.3 tumor suppressor in cells of the sample. The cancer may be 
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distinguished as metastatic and non-metastatic. The determining may comprise assaying for a 
TS 1 0q23 3 nucleic acid or polypeptide in the sample. 

In still yet an additional embodiment, there is provided a method of screening a candidate 
substance for anti-tumor activity comprising the steps of (i) providing a cell lacking functional 
TS10q23.3 polypeptide; (ii) contacting the cell with the candidate substance; and (iii) 
determining the effect of the candidate substance on the cell. The cell may be a tumor cell, for 
example, a tumor cell having a mutation in the coding region of TS 10q23.3.7. The mutation 
may be a deletion mutant, an insertion mutant, a frameshift mutant, a nonsense mutant, a 
missense mutant or splice mutant. The determining may comprise comparing one or more 
characteristics of the cell in the presence of the candidate substance with characteristics of a cell 
in the absence of the candidate substance. The characteristic may be TS10q23.3 expression, 
phosphatase activity, proliferation, metastasis, contact inhibition, soft agar growth, cell cycle 
regulation, tumor formation, tumor progression and tissue invasion. The candidate substance 
may be a chemotherapeutic or radiotherapeutic agent or be selected from a small molecule 
library. The cell may be contacted in vitro or in vivo. 

In still a further additional embodiment, there is provided a method of screening a 
candidate substance for anti-kinase activity comprising the steps of (i) providing a having 
TSl0q23.3 polypeptide comprising at least one tyrosine kinase site; (ii) contacting the cell with 
the candidate substance; and (iii) determining the effect of the candidate substance on the 
phosphorylation of the site. The determining may comprise comparing one or more 
characteristics of the cell in the presence of the candidate substance with characteristics of a cell 
in the absence of the candidate substance. The characteristic may be phosphorylation status of 
TS10q23.3, TS10q23.3 expression, phosphatase activity, proliferation, metastasis, contact 
inhibition, soft agar growth, cell cycle regulation, tumor formation, tumor progression and tissue 
invasion. The candidate substance may be a chemotherapeutic or radiotherapeutic agent or be 
selected from a small molecule library. The cell may be contacted in vitro or in vivo. 

BRIEF DESCRIPT ION OF THE DRAWINGS 

The following drawings form part of the present specification and are included to further 
demonstrate certain aspects of the present invention. The invention may be better understood by 
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reference to one or more of these drawings in combination with the detailed description of 
specific embodiments presented herein: 

FIG. 1 - Localization of Candidate Tumor Suppressor Loci on Human Chromosome 10 . 
Various loci on the human chromosome 10 have been implicated as possible sites for tumor 
5 suppressing activity. These locations, and the reporting group, are depicted. 

FIG. 2 - Illustration of Homozygous Deletions in Glioma Cell Lines . Various glioma cell 
lines were screened for the presence of deletions in both copies of loci on chromosome 10. Loci 
are indicated on the vertical axis and cell lines are listed across the horizontal axis. Homozygous 
loss is indicated by a darkened oval. 

FIG. 3 - Illustration of Regions of Chromosome 10: Presence or Absence of DNA 
Microsatellite Markers in Hybrid Clone . Regions of chromosome 10 presence (solid circle) or 
absence (open circle) of DNA corresponding to chromosome 10 specific microsatellite markers 
from eleven subclones of the somatic cell hybrid clone U251.N10.7 that were transferred into 
mouse A9 cells are illustrated. The U251.N10.6 and U251.N10.8 somatic cell hybrids are fully 
suppressed clones, exhibiting no or little growth in soft agarose, and the U251.10.5A and C 
subclones are partially suppressed (Steck et aL, 1995). The difference between the fully 
suppressed clones and the partially suppressed clones provides a functional localization of the 
tumor suppressor gene. The possible regions that contain the tumor suppressor gene are 
indicated by the hatched boxes. The hatched box at 10q23.3 overlaps with the homozygous 
deletions and region implicated by allelic deletion analysis (see FIG. 2 and FIG. 4). 

FIG. 4 - Deletion Map of Chromosome 10 in Human Gliomas . The region bounded by 
the markers D10S551 to D10S583 are located in a 10 cM region. The microsatellites are shown 
in their order of most probably linkage and mapped to their approximate chromosomal location 
based on the radiation hybrid map as described by Gyapay et al, 1994. The region of 
25 chromosome 101 that is not involved in anaplastic astrocytomas and one glioma is shown in the 
boxed regions of the tumor. The critical region defined from the homozyogous deletion analysis 
and not excluded by this analysis is shown by the solid bar on the right side. 

FIG. 5 - Mapping of BAC 106dl6 . Mapping of the BAC designated 106dl6, and 
demonstration of homozygous deletion by Southern blotting is illustrated. The partial restriction 
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map of 106dl6 is depicted. The illustration of the blot shows the homozygous deletion of Eco 
band #14 (Mr approx. 1 1 kb) in EFC-2 cells. 

FIG - 6 ' Coding Sequence and 5'- a nd 3'-Flankinp Regions of TSin n n ? Coding 
region is in bold as is the first in frame stop codon. 

nG * 7 " Predicted Amino Acid Sequence of TS10q 23 3 p™h,«* Abbreviations are A, 
alanine; C, cysteine; D, aspartic acid; E, glutamic acid; F; phenylalanine; G, glycine; H, histidine; 
I, isoleucine; K, lysine; L, leucine; M, methionine; N, asparagine; P, proline; Q, glutamine; R, 
arginine; S, serine; T, threonine; V, valine; W, tryptophan; Y, tyrosine. Phosphatase consensus 
site is in bold; tyrosine phosphorylation sites are italicized and underlined. 

FIG - 8 " Optional Analysis of 10q23 3 . Glioma line initially indicated as having 
homozygous deletions in 10q23.3 were reanalyzed for the presence of the TSl0q23.3 gene. 
Darkened oval indicates that the gene region is present; open oval indicates a homozygous 
deletion in the gene region. * - indicates exons trapped. 

FIG * 9 • Homology Comparison of Human TS10q 23 3 with Mouse and Doe Homnlnp s 
The initiation ATG codon and methionine amino acid are designated at the start (1) position. 
The termination codon is TGA (1210). Alterations between the human and mouse or dog 
sequences on the genomic or amino acid level are designated by a star in the sequence compared. 
However, no changes in the amino acid sequence were observed. 

FIG - 10 " Sequence of Exons and Surrounding Tntronic Reg i ons of TS10q ?3 i The 
exons are denoted as capital letters starting at position one, and introns are designated lower 
case letters; except for the first exon where the initiation codon starts at position one and the 3' 
exon/intron boundary is at position 79 and 80, respectively. The lower case letter designate 
(Table 4) corresponds to the numbering of the sequence presented in this figure, except for the 
first exon. The mutations for U87 and U138 are at the first intron G residue [G+1>T] after the 
exon (G and H, respectively). For T98G and KE, the point mutations are at positions 46 and 28 
of exon B, respectively. For LnCap cells, the mutation is a deletion of bases 16 and 17 in the 
first intron. 

FIGS. 11A-G - Analysis of Secondary Structures j n TS1023 3 FIG. 11 A: 
Hydrophilicity plot; FIG. 11B: Surface probability plot; FIG. 11C: Flexibility plot; FIG. 11D: 



WO 98/33907 PCT/US98/00353 

- 11 - 

Antigenic index plot; FIG. HE: Amphiphilic helix plot; FIG. 1 IF: Amphiphilic sheet plot; FIG. 
1 1G: Secondary structure plot. 

FIGS. 12A-I - Comparison of Predicted Characteristics in TS10q23.3 and Point Mutants 
T98G and KE. FIG. 12A: Hydrophilicity plot of residues 1-60 of wild-type polypeptide; FIG. 
5 12B: Surface probability plot of residues 1-60 of wild-type polypeptide; FIG. 12C: Secondary 
structure plot of residues 1-60 of wild-type polypeptide; FIG 12D: Hydrophilicity plot of 
residues 1-60 of KE mutant; FIG. 12E: Surface probability plot of residues 1-60 of KE mutant; 
FIG. 12 F: Secondary structure plot of residues 1-60 of KE mutant; FIG. 12G: Hydrophilicity 
plot of residues 1-60 of T98G mutant; FIG. 12H: Surface probability plot of residues 1-60 of 
10 T98G mutant; FIG. 121: Secondary structure plot of residues 1-60 of T98G mutant. The T98G 
mutation (Leu — » Arg) at residue 42 results in the loss of proposed helix secondary structure of 
TS10q23.3. The mutation in KE at residue 36 (Gly — » Glu) would significantly increase the 
length of the proposed helical structure in the region. Both mutations would affect the same 
helical structure. Also, minor changes in the hydrophilicity and surface probability arise. 

15 SEQUENCE SUMMARY 

SEQ ID NO:l=predicted sequence for TS10Q23.3; SEQ LD NO:2=human gene 
sequence; SEQ ED NO:3=mouse gene sequence; SEQ ID NO:4=dog gene sequence; SEQ ID 
NO:5=human peptide sequence; SEQ ID NO:6=mouse peptide sequence; SEQ ID NO:7=dog 
peptide sequence; SEQ ID NO:8=exon a; SEQ ID NO:9=exon b; SEQ ID NO: 10=exon c; SEQ 

20 ID NO:l 1 = exon d; SEQ ID NO:12 = exon e; SEQ ID NO:13 = exon f; SEQ ID NO:14 = exon 
g; SEQ ID NO: 15 = exon h; SEQ ID NO 16 = exon I; SEQ ID NO: 17 = a motif from residues 
88 to 98; SEQ ID NO: 18 = conserved catalytic domain of a protein tyrosine phosphatase (Denu 
et ai y 1996); SEQ ID NO: 19 residues 1-60 of wild-type TS10q23.3 polypeptide; SEQ ID 
NO:20 residues 1-60 of T98G mutant TS10q23.3 polypeptide; SEQ ID NO:21 residues 1-60 of 

25 KE mutant TS 1 0q23 3 polypeptide. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

I. The Present Invention 

As stated above, a number of different groups have shown evidence of a tumor 
suppressing activity associated with the lOq region of human chromosome 10. Despite this 
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considerable amount of work, the identity of the gene or genes responsible for this activity has 
not been determined. Previous used a functional approach involving transfer of chromosomes 
or chromosomal fragments suspected of harboring tumor suppressor gene(s) into tumorigenic 
glioma cells. These efforts allowed definition of the biological activity of putative tumor 
suppressor gene(s) and aid in the localization of such. Chromosomes 2 and 10 were transferred 
into U251 glioma cells and chromosomes 2 and 10 into LG-11 cells. The LG-11 cells were 
shown to have no intact copies of chromosome 10 and the breakpoint was subsequently found to 
occur at 10q24. The transfer of chromosome 10 resulted in hybrid cells that displayed a 
suppressed phenotype, exhibiting a loss of tumorigenicity (no tumor formation) and loss of the 
ability to grow in soft agarose (SOX to 1000X decrease; Pershouse et al, 1993). The hybrid's 
exponential growth rate was similar to the parental cell, although the hybrid cell's saturation 
density was significantly (10X to 20X) lower than the parental cells. The transfer of 
chromosome 2 resulted in hybrid cells that acted similar to the parental cells. 

One objective of these studies was to localize the chromosome 10 suppressor gene by 
fragmentation of the neomycin-tagged chromosome 10 and, subsequently, to transfer the 
fragmented chromosome into glioma cells. However, the inventors observed that some of the 
hybrid cells had spontaneously undergone chromosomal rearrangements to yield hybrid cells 
retaining only various regions of the inserted chromosome 10 (Pershouse et al, 1993). The 
inventors then subcloned the hybrids and analyzed them, rather than initiate fragmentation 
studies (Steck et al, 1995). The retention of the inserted chromosome 10 or its fragments was 
tracked by informative RFLP markers and FISH analysis. Interestingly, only the inserted 
chromosome was subjected to rearrangement. The insertion of an entire copy of chromosome 
10 resulted in inhibition of the hybrid cell's transformed property to proliferate in soft agarose 
and to form tumors in nude mice. 

These two phenotypes now appear to be partially separable by the instant analysis Some 
subclones (U251.NlO.5a-j), which revealed a loss of a major portion of the long arm of 
chromosome 10, grew in soft agarose but failed to form tumors in nude mice, thus indicating 
that a tumor suppressive locus resides in the remaining portion of the chromosome (lOpter to 
lOqll). In contrast, clones that retained a distal region of the long arm, 10q24 to 10q26, failed 
both to proliferate m soft agarose and in nude mice (see FIG. 4). This suggests Another 
phenotypic suppressive region residing in the distal region of the chromosome. The lack of 
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additional 1 0-associated material further would suggest that the remaining chromosome 10 
material is responsible for the altered biological phenotype. These results implicate the presence 
of two phenotypically independent suppressive regions on chromosome 10 involved in glioma 
progression (Steck et al, 1995). 

According to the present invention, the inventors now have used several independent 
strategies to localize a tumor suppressor gene, designated TS10q23.3, that is involved in 
gliomas, breast cancer, prostate cancer and other cancers. These approaches, described in 
greater detail in the following Examples, included (i) identification of homozygous deletions in a 
series of human glioma cell lines; (ii) determination of a consistent region(s) of retention in 
clones suppressed for turaorigenicity; and (iii) allelic deletion studies on various grades of glioma 
and corresponding normal samples. With the gene in hand, it now becomes possible to exploit 
the information encoded by the gene to develop novel diagnostic and therapeutic approaches 
related to human cancer. 



II. The 10q23.3 Tumor Suppressor 

According to the present invention, there has been identified a tumor suppressor, 
encoded by a gene in the 10q23.3 locus, and designated here as TS10q23.3. This molecule is 
capable of suppressing tumor phenotypes in various cancers. The term tumor suppressor is well- 
known to those of skill in the art. Examples of other tumors suppressors are p53, Rb and pi 6, 
to name a few. While these molecules are structurally distinct, they form group of functionally- 
related molecules, of which TS10q23.3 is a member. The uses for which these other tumor 
suppressors now are being exploited are equally applicable here. 

In addition to the entire TS10q23.3 molecule, the present invention also relates to 
fragments of the polypeptide that may or may not retain the tumor suppressing (or other) 
activity. Fragments including the N-terminus of the molecule may be generated by genetic 
engineering of translation stop sites within the coding region (discussed below). Alternatively, 
treatment of the TS10q23.3 molecule with proteolytic enzymes, known as protease, can 
produces a variety of N-terminal, C-terminal and internal fragments. Examples of fragments may 
include contiguous residues of the TS10q23.3. sequence given in FIG. 7 and FIG. 9, of 6, 7, 8, 
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9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 30, 35, 40, 45, 50, 55, 60, 65, 
75, 80, 85, 90, 95, 100, 200, 300, 400 or more amino acids in length. These fragments may be 
purified according to known methods, such as precipitation (e.g., ammonium sulfate), HPLC, 
ion exchange chromatography, affinity chromatography (including immunoaffinity 
chromatography) or various size separations (sedimentation, gel electrophoresis, gel filtration). 

A . Structural Features of the Polypeptide 

The gene for TS10q23.3 encodes a 403 amino acid polypeptide. The predicted 
molecular weight of this molecule is 47,122, with a resulting pi of 5.86. Thus, at a minimum, this 
molecule may be used as a standard in assays where molecule weight and pi are being examined. 

A phosphatase consensus site located at residues 122-131, matching perfectly the 
tyrosine phosphatase (PTP) consensus sequence: [W]HCxAGxxR[S/T]G. Outside the active 
domains, sequences differ greatly. PTPs proceed through phosphoenzyme intermediates. The 
enzymatic reaction involves phosphoryl-cysteine intermediate formation after nucleophilic attack 
of the phosphorus atom of the substrate by the thiolate anion of cysteine. The reaction can be 
represented as a two-step chemical process: phosphoryl transfer to the enzyme accompanied by 
rapid release of dephosphorylated product; and hydrolysis of the thiol-phosphate intermediate 
concomitant with rapid release of phosphate. To form the catalytically competent component 
complex, the enzyme binds and reacts with the dianion of phosphate-containing substrate. On 
the enzyme, an aspartic acid must be protonated and the nucleophilic cysteine must be 
unprotonated (thiolate anion) for phosphoryl transfer to the enzyme. Also of note are potential 
tyrosine phosphorylation sites located at residues 233-240 and 308-315 and cAMP 
phosphorylation sites located at residues 128, 164, 223 and 335. Phosphatases are known to 
have kinase sites, and the phosphatase activity of these enzymes can be modulated by 
phosphorylation at these sites. Protein phosphatases generally are divided into two categories - 
serine/threonine phosphatases and tyrosine phosphatases. Certain of the tyrosine phosphatases 
also have activity against phosphoserine and phosphothreonine. 

The interaction between kinases and phophatases, and the various phosphorylation states 
of polypeptides, have been demonstrated as important features in cell physiology. Through a 
variety of different mechanisms, kinases and phophatases act in different pathways within cells 
that are involved in signaling, energy storage and cell regulation. Since the identification of an 



WO 98/33907 



- 15- 



PCT/US98/00353 



intrinsic tyrosine kinase function in the transforming protein src (Collett & Erickson, 1978), the 
role of phosphorylation, particularly on tyrosine residues, has been demonstrated to play a 
critical role in the control of cellular proliferation and the induction of cancer (Hunter, 1991; 
Bishop, 1991). The roles that protein phosphatases play in growth regulation, as well as in many 
5 other biological and biochemical activities, has been correlated with the phosphrylation state of 
biologically important molecules (Cohen, 1994). 

It also should be mentioned that the 60 or so amino acids of the N-terminus of the 
molecule show some homology to tensin, a cytoskeletal protein implicated in adhesion plaques. 
This suggest that TS10q23.3 maybe involved cell surface phenomena such as contact inhibition, 
10 invasion, migration or cell-to-cell signaling. TS10q23.3 point mutations identified in certain 
tumor cell lines affect secondary proposed structures in this region. 

B. Functional Aspects 

When the present application refers to the function of TS10q23.3 or "wild-type" activity, 
it is meant that the molecule in question has the ability to inhibit the transformation of cell from a 
15 normally regulated state of proliferation to a malignant state, i.e., one associated with any sort of 
abnormal growth regulation, or to inhibit the transformation of a cell from an abnormal state to a 
highly malignant state, e.g., to prevent metastasis or invasive tumor growth. Other phenotypes 
that may be considered to be regulated by the normal TS10q23.3 gene product are angiogenesis, 
adhension, migration, cell-to-cell signaling, cell growth, cell proliferation, density-dependent 
20 growth, anchorage-dependent growth and others. Determination of which molecules possess 
this activity may be determined using assays familiar to those of skill in the art. For example, 
transfer of genes encoding TS10q23.3, or variants thereof, into cells that do not have a 
functional TS10q23.3 product, and hence exhibit impaired growth control, will identify, by 
virtue of growth suppression, those molecules having TS10q23.3 function. 

As stated above, there is an indication that TS10q23.3 is a phosphatase. The portion of 
the protein located at residues 88 to 98 is a perfect match for the conserved catalytic domain of 
protein tyrosine phosphatase. Also, putative kinase targets are located in the molecule, which is 
another characteristic of phosphatases. Because other tumor suppressors have been identified 
with this sort of activity, it will be desirable to determine the phosphatase function in the tumor 
suppressing role of TS10q23.3. This also may be a fruitful approach to developing screening 
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assays for the absence of TS10q23.3 function or in the development of cancer therapies, for 
example, in targeting the phosphatase function of TS10q23.3, targeting the substrate upon which 
TS10q23.2 acts, and/or targeting the kinase or kinases which act upon TS10q23.3. 

C. Variants of TS1 0q23. 3 

5 Amino acid sequence variants of the polypeptide can be substitutional, insertional or deletion 

variants. Deletion variants lack one or more residues of the native protein which are not essential for 
function or immunogenic activity, and are exemplified by the variants lacking a transmembrane 
sequence described above. Another common type of deletion variant is one lacking secretory signal 
sequences or signal sequences directing a protein to bind to a particular part of a cell.. Insertional 
1 0 mutants typically involve the addition of material at a non-terminal point in the polypeptide. This may 
include the insertion of an immunoreactive epitope or simply a single residue. Terminal additions, 
called fusion proteins, are discussed below. 

Substitutional variants typically contain the exchange of one amino acid for another at one or 
more sites within the protein, and may be designed to modulate one or more properties of the 

15 polypeptide, such as stability against proteolytic cleavage, without the loss of other functions or 
properties. Substitutions of this kind preferably are conservative, that is, one amino acid is replaced 
with one of similar shape and charge. Conservative substitutions are well known in the art and 
include, for example, the changes of: alanine to serine; arginine to lysine; asparagine to glutamine or 
histidine; aspartate to glutamate; cysteine to serine; glutamine to asparagine; glutamate to aspartate; 

20 glycine to proline; histidine to asparagine or glutamine; isoleucine to leucine or valine; leucine to 
valine or isoleucine; lysine to arginine; methionine to leucine or isoleucine; phenylalanine to tyrosine, 
leucine or methionine; serine to threonine; threonine to serine; tryptophan to tyrosine; tyrosine to 
tryptophan or phenylalanine; and valine to isoleucine or leucine. 

The following is a discussion based upon changing of the amino acids of a protein to create an 
25 equivalent, or even an improved, second-generation molecule. For example, certain amino acids may 
be substituted for other amino acids in a protein structure without appreciable loss of interactive 
binding capacity with structures such as, for example, antigen-binding regions of antibodies or binding 
sites on substrate molecules. Since it is the interactive capacity and nature of a protein that defines 
that protein's biological functional activity, certain amino acid substitutions can be made in a protein 
30 sequence, and its underlying DNA coding sequence, and nevertheless obtain a protein with like 
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properties. It is thus contemplated by the inventors that various changes may be made in the DNA 
sequences of genes without appreciable loss of their biological utility or activity, as discussed below. 
Table 1 shows the codons that encode particular amino acids. 

In making such changes, the hydropathic index of amino acids may be considered. The 
5 importance of the hydropathic amino acid index in conferring interactive biologic function on a 
protein is generally understood in the art (Kyte & Doolittle, 1982). It is accepted that the relative 
hydropathic character of the amino acid contributes to the secondary structure of the resultant 
protein, which in turn defines the interaction of the protein with other molecules, for example, 
enzymes, substrates, receptors, DNA, antibodies, antigens, and the like. 

10 Each amino acid has been assigned a hydropathic index on the basis of their 

hydrophobicity and charge characteristics (Kyte & Doolittle, 1982), these are: isoleucine (+4.5); 

valine (+4.2); leucine (+3.8); phenylalanine (+2.8); cysteine/cystine (+2.5); methionine (+1.9); 

alanine (+1.8); glycine (-0.4); threonine (-0.7); serine (-0.8); tryptophan (-0.9); tyrosine (-1.3); 

proline (-1.6); histidine (-3.2); glutamate (-3.5); glutamine (-3.5); aspartate (-3.5); asparagine (- 
15 3.5); lysine (-3.9); and arginine (-4.5). 

It is known in the art that certain amino acids may be substituted by other amino acids 
having a similar hydropathic index or score and still result in a protein with similar biological 
activity, i.e., still obtain a biological functionally equivalent protein. In making such changes, the 
substitution of amino acids whose hydropathic indices are within ±2 is preferred, those which are 
20 within ±1 are particularly preferred, and those within ±0.5 are even more particularly preferred. 

It is also understood in the art that the substitution of like amino acids can be made 
effectively on the basis of hydrophilicity. U.S. Patent 4,554,101, incorporated herein by 
reference, states that the greatest local average hydrophilicity of a protein, as governed by the 
hydrophilicity of its adjacent amino acids, correlates with a biological property of the protein. 
25 As detailed in U.S. Patent 4,554,101, the following hydrophilicity values have been assigned to 
amino acid residues: arginine (+3.0); lysine (+3.0); aspartate (+3.0 ± 1); glutamate (+3.0 ± 1); 
serine (+0.3); asparagine (+0.2); glutamine (+0.2); glycine (0); threonine (-0.4); proline (-0.5 ± 
1); alanine (-0.5); histidine *-0.5); cysteine (-1.0); methionine (-1.3); valine (-1.5); leucine (-1.8); 
isoleucine (-1.8); tyrosine (-2.3); phenylalanine (-2.5); tryptophan (-3.4). 
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It is understood that an amino acid can be substituted for another having a similar 
hydrophilicity value and still obtain a biologically equivalent and immunologically equivalent 
protein. In such changes, the substitution of amino acids whose hydrophilicity values are within 
±2 is preferred, those that are within ±1 are particularly preferred, and those within ±0.5 are 
even more particularly preferred. 

As outlined above, amino acid substitutions are generally based on the relative similarity 
of the amino acid side-chain substituents, for example, their hydrophobicity, hydrophilicity, 
charge, size, and the like. Exemplary substitutions that take various of the foregoing 
characteristics into consideration are well known to those of skill in the art and include: arginine 
and lysine; glutamate and aspartate; serine and threonine; glutamine and asparagine; and valine, 
leucine and isoleucine. 

Another embodiment for the preparation of polypeptides according to the invention is the use 
of peptide mimetics. Mimetics are peptide-containing molecules that mimic elements of protein 
secondary structure. See, for example, Johnson et al, "Peptide Turn Mimetics" in 
BIOTECHNOLOGY AND PHARMACY, Pezzuto et al., Eds., Chapman and Hall, New York (1993). 
The underlying rationale behind the use of peptide mimetics is that the peptide backbone of proteins 
exists chiefly to orient amino acid side chains in such a way as to facilitate molecular interactions, such 
as those of antibody and antigen. A peptide mimetic is expected to permit molecular interactions 
similar to the natural molecule. These principles may be used, in conjunction with the principles 
outline above, to engineer second generation molecules having many of the natural properties of 
TS10q23.3, but with altered and even improved characteristics. 

D. Domain Switching 

As described in the examples, the present inventors have identified putative murine and 
canine homologs of the human TSl0q23.3 gene. In addition, mutations have been identified in 
TS10q23.3 which are believed to alter its function. These studies are important for at least two 
reasons. First, they provides a reasonable expectation that still other homologs, allelic variants 
and mutants of this gene may exist in related species, such as rat, rabbit, monkey, gibbon, chimp, 
ape, baboon, cow, pig, horse, sheep and cat. Upon isolation of these homologs, variants and 
mutants, and in conjunction with other analyses, certain active or functional domains can be 
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identified. Second, this will provide a starting point for further mutational analysis of the 
molecule. One way in which this information can be exploited is in "domain switching." 

Domain switching involves the generation of chimeric molecules using different but, in 
this case, related polypeptides. By comparing the mouse, dog and human sequences for 
5 TS10q23.3 with the TS10q23.3 of other species, and with mutants and allelic variants of these 
polypeptides, one can make predictions as to the functionally significant regions of these 
molecules. It is possible, then, to switch related domains of these molecules in an effort to 
determine the criticality of these regions to TS10q23.3 function. These molecules may have 
additional value in that these "chimeras" can be distinguished from natural molecules, while 
10 possibly providing the same function. 

Based on the sequence identity, at the amino acid level, of the mouse, dog and human 
sequences, it may be inferred that even small changes in the primary sequence of the molecule 
will affect function. Further analysis of mutations and their predicted effect on secondary 
structure will add to this understanding. 

15 Another structural aspect of TS10q23.3 that provides fertile ground for domain 

switching experiments is the tyrosine phosphatase-like domain and the putative tyrosine 
phosphorylation sites. This domain may be substituted for other phosphatase domains in order 
to alter the specificity of this function. A further investigation of the homology between 
TS10q23.3 and other phosphatases is warranted by this observation. 

20 E. Fusion Proteins 

A specialized kind of insertional variant is the fusion protein. This molecule generally has all 
or a substantial portion of the native molecule, linked at the N- or C-terminus, to all or a portion of a 
second polypeptide. For example, fusions typically employ leader sequences from other species to 
permit the recombinant expression of a protein in a heterologous host. Another useful fusion includes 
25 the addition of a immunologically active domain, such as an antibody epitope, to facilitate purification 
of the fusion protein. Inclusion of a cleavage site at or near the fusion junction will facilitate removal 
of the extraneous polypeptide after purification. Other useful fusions include linking of functional 
domains, such as active sites from enzymes, glycosylation domains, cellular targeting signals or 
transmembrane regions. One particular fusion of interest would include a deletion construct lacking 
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the phosphatase site of TS10q23.3 but containing other regions that could bind the substrate 
molecule. Fusion to a polypeptide that can be used for purification of the substrate-TS10q23.3 
complex would serve to isolated the substrate for identification and analysis. 

K Purification of Proteins 

5 It will be desirable to purify TS10q23.3 or variants thereof. Protein purification 

techniques are well known to those of skill in the art. These techniques involve, at one level, the 
crude fractionation of the cellular milieu to polypeptide and non-polypeptide fractions. Having 
separated the polypeptide from other proteins, the polypeptide of interest may be further purified 
using chromatographic and electrophoretic techniques to achieve partial or complete purification 
10 (or purification to homogeneity). Analytical methods particularly suited to the preparation of a 
pure peptide are ion-exchange chromatography, exclusion chromatography; polyacrylamide gel 
electrophoresis; isoelectric focusing. A particularly efficient method of purifying peptides is fast 
protein liquid chromatography or even HPLC. 

Certain aspects of the present invention concern the purification, and in particular 
15 embodiments, the substantial purification, of an encoded protein or peptide. The term "purified 
protein or peptide" as used herein, is intended to refer to a composition, isolatable from other 
components, wherein the protein or peptide is purified to any degree relative to its naturally- 
obtainable state. A purified protein or peptide therefore also refers to a protein or peptide, free 
from the environment in which it may naturally occur. 

20 Generally, "purified" will refer to a protein or peptide composition that has been 

subjected to fractionation to remove various other components, and which composition 
substantially retains its expressed biological activity. Where the term "substantially purified" is 
used, this designation will refer to a composition in which the protein or peptide forms the major 
component of the composition, such as constituting about 50%, about 60%, about 70%, about 

25 80%, about 90%, about 95% or more of the proteins in the composition. 

Various methods for quantifying the degree of purification of the protein or peptide will 
be known to those of skill in the art in light of the present disclosure. These include, for 
example, determining the specific activity of an active fraction, or assessing the amount of 
polypeptides within a fraction by SDS/PAGE analysis. A preferred method for assessing the 
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purity of a fraction is to calculate the specific activity of the fraction, to compare it to the 
specific activity of the initial extract, and to thus calculate the degree of purity, herein assessed 
by a "-fold purification number." The actual units used to represent the amount of activity will, 
of course, be dependent upon the particular assay technique chosen to follow the purification 
5 and whether or not the expressed protein or peptide exhibits a detectable activity. 

Various techniques suitable for use in protein purification will be well known to those of 
skill in the art. These include, for example, precipitation with ammonium sulphate, PEG, 
antibodies and the like or by heat denaturation, followed by centrifugation; chromatography 
steps such as ion exchange, gel filtration, reverse phase, hydroxylapatite and affinity 
10 chromatography; isoelectric focusing; gel electrophoresis; and combinations of such and other 
techniques. As is generally known in the art, it is believed that the order of conducting the 
various purification steps may be changed, or that certain steps may be omitted, and still result in 
a suitable method for the preparation of a substantially purified protein or peptide. 

There is no general requirement that the protein or peptide always be provided in their 
15 most purified state. Indeed, it is contemplated that less substantially purified products will have 
utility in certain embodiments. Partial purification may be accomplished by using fewer 
purification steps in combination, or by utilizing different forms of the same general purification 
scheme. For example, it is appreciated that a cation-exchange column chromatography 
performed utilizing an HPLC apparatus will generally result in a greater "-fold" purification than 
20 the same technique utilizing a low pressure chromatography system. Methods exhibiting a lower 
degree of relative purification may have advantages in total recovery of protein product, or in 
maintaining the activity of an expressed protein. 

It is known that the migration of a polypeptide can vary, sometimes significantly, with 
different conditions of SDS/PAGE (Capaldi et al t 1977). It will therefore be appreciated that 
25 under differing electrophoresis conditions, the apparent molecular weights of purified or partially 
purified expression products may vary. 

High Performance Liquid Chromatography (HPLC) is characterized by a very rapid 
separation with extraordinary resolution of peaks. This is achieved by the use of very fine 
particles and high pressure to maintain an adequate flow rate. Separation can be accomplished in 
30 a matter of minutes, or at most an hour. Moreover, only a very small volume of the sample is 
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needed because the particles are so small and close-packed that the void volume is a very small 
fraction of the bed volume. Also, the concentration of the sample need not be very great 
because the bands are so narrow that there is very little dilution of the sample. 

Gel chromatography, or molecular sieve chromatography, is a special type of partition 
5 chromatography that is based on molecular size. The theory behind gel chromatography is that 
the column, which is prepared with tiny particles of an inert substance that contain small pores, 
separates larger molecules from smaller molecules as they pass through or around the pores, 
depending on their size. As long as the material of which the particles are made does not adsorb 
the molecules, the sole factor determining rate of flow is the size. Hence, molecules are eluted 
10 from the column in decreasing size, so long as the shape is relatively constant. Gel 
chromatography is unsurpassed for separating molecules of different size because separation is 
independent of all other factors such as pH, ionic strength, temperature, etc. There also is 
virtually no adsorption, less zone spreading and the elution volume is related in a simple matter 
to molecular weight. 

15 Affinity Chromatography is a chromatographic procedure that relies on the specific 

affinity between a substance to be isolated and a molecule that it can specifically bind to. This is 
a receptor-ligand type interaction. The column material is synthesized by covalently coupling 
one of the binding partners to an insoluble matrix. The column material is then able to 
specifically adsorb the substance from the solution. Elution occurs by changing the conditions to 

20 those in which binding will not occur (alter pH, ionic strength, temperature, etc.). 

A particular type of affinity chromatography useful in the purification of carbohydrate 
containing compounds is lectin affinity chromatography. Lectins are a class of substances that 
bind to a variety of polysaccharides and glycoproteins. Lectins are usually coupled to agarose by 
cyanogen bromide. Conconavalin A coupled to Sepharose was the first material of this sort to 

25 be used and has been widely used in the isolation of polysaccharides and glycoproteins other 
lectins that have been include lentil lectin, wheat germ agglutinin which has been useful in the 
purification of N-acetyl glucosaminyl residues and Helix pomatia lectin. Lectins themselves are 
purified using affinity chromatography with carbohydrate ligands. Lactose has been used to 
purify lectins from castor bean and peanuts; maltose has been useful in extracting lectins from 

30 lentils and jack bean; N-acetyl-D galactosamine is used for purifying lectins from soybean; N- 
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acetyl glucosaminyl binds to lectins from wheat germ; D-galactosamine has been used in 
obtaining lectins from clams and L-fucose will bind to lectins from lotus. 

The matrix should be a substance that itself does not adsorb molecules to any significant 
extent and that has a broad range of chemical, physical and thermal stability. The ligand should 
be coupled in such a way as to not affect its binding properties. The ligand should also provide 
relatively tight binding. And it should be possible to elute the substance without destroying the 
sample or the ligand. One of the most common forms of affinity chromatography is 
immunoaffinity chromatography. The generation of antibodies that would be suitable for use in 
accord with the present invention is discussed below. 

G. Synthetic Peptides 

The present invention also describes smaller TS10q23.3 -related peptides for use in 
various embodiments of the present invention. Because of their relatively small size, the peptides 
of the invention can also be synthesized in solution or on a solid support in accordance with 
conventional techniques. Various automatic synthesizers are commercially available and can be 
used in accordance with known protocols. See, for example, Stewart and Young, (1984); Tarn 
et aL, (1983); Merrifield, (1986); and Barany and Merrifield (1979), each incorporated herein by 
reference. Short peptide sequences, or libraries of overlapping peptides, usually from about 6 up 
to about 35 to 50 amino acids, which correspond to the selected regions described herein, can be 
readily synthesized and then screened in screening assays designed to identify reactive peptides. 
Alternatively, recombinant DNA technology may be employed wherein a nucleotide sequence 
which encodes a peptide of the invention is inserted into an expression vector, transformed or 
transfected into an appropriate host cell and cultivated under conditions suitable for expression. 

H. Antigen Compositions 

The present invention also provides for the use of TS10q23.3 proteins or peptides as 
antigens for the immunization of animals relating to the production of antibodies. It is 
envisioned that either TS10q23.3, or portions thereof, will be coupled, bonded, bound, 
conjugated or chemically-linked to one or more agents via linkers, polylinkers or derivatized 
amino acids. This may be performed such that a bispecific or multivalent composition or vaccine 
is produced. It is further envisioned that the methods used in the preparation of these 
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compositions will be familiar to those of skill in the art and should be suitable for administration 
to animals, i.e., pharmaceutical^ acceptable. Preferred agents are the carriers are keyhole limpet 
hemocyannin (KLH) or bovine serum albumin (BSA). 



m. Nucleic Acids 

The present invention also provides, in another embodiment, genes encoding TS10q23.3. 
Genes for the human, canine and murine TS10q23.3 molecule have been identified. The present 
invention is not limited in scope to these genes, however, as one of ordinary skill in the could, 
using these two nucleic acids, readily identify related homologs in various other species (e.g., m, 
rabbit, monkey, gibbon, chimp, ape, baboon, cow, pig, horse, sheep, cat and other species). The 
finding of mouse and dog homologs for this gene makes it likely that other species more closely 
related to humans will, in fact, have a homolog as well. 

In addition, it should be clear that the present invention is not limited to the specific 
nucleic acids disclosed herein. As discussed below, a «TS10q23.3 gene" may contain a variety 
of different bases and yet still produce a corresponding polypeptides that is functionally 
^distinguishable, and in some cases structurally, from the human and mouse genes disclosed 
herein. 

Similarly, any reference to a nucleic acid should be read as encompassing a host cell 
containing that nucleic acid and, in some cases, capable of expressing the product of that nucleic 
acid. In addition to therapeutic considerations, cells expressing nucleic acids of the present 
invention may prove useful in the context of screening for agents that induce, repress, inhibit, 
augment, interfere with, block, abrogate, stimulate or enhance the function of TS 1 0q23.3 

A. Nucleic Acids Encoding J 0q23.3 

The human gene disclosed in FIGS. 6 and 9, and the murine gene disclosed in FIG. 9 are 
TS10q23.3 genes of the present invention. Nucleic acids according to the present invention may 
encode an entire TSl0q23.3 gene, a domain of TS10q23.3 that expresses a tumor suppressing or 
phosphatase function, or any other fragment of the TS10q23.3 sequences set forth herein. The 
nucleic acid may be derived from genomic DNA, /.*., cloned directly from the genome of a particular 



WO 98/33907 PCTAJS98/00353 

-25- 

organism. In preferred embodiments, however, the nucleic acid would comprise complementary 
DNA (cDNA). Also contemplated is a cDNA plus a natural intron or an intron derived from another 
gene; such engineered molecules are sometime referred to as "mini-genes." At a minimum, these and 
other nucleic acids of the present invention may be used as molecular weight standards in, for 
example, gel electrophoresis. 

The term "cDNA" is intended to refer to DNA prepared using messenger RNA (mRNA) as 
template. The advantage of using a cDNA, as opposed to genomic DNA or DNA polymerized from 
a genomic, non- or partially-processed RNA template, is that the cDNA primarily contains coding 
sequences of the corresponding protein. There may be times when the full or partial genomic 
sequence is preferred, such as where the non-coding regions are required for optimal expression or 
where non-coding regions such as introns are to be targeted in an antisense strategy. 

It also is contemplated that a given TS10q23.3 from a given species may be represented by 
natural variants that have slightly different nucleic acid sequences but, nonetheless, encode the same 
protein (see Table 1 below). 

As used in this application, the term "a nucleic acid encoding a TS10q23.3" refers to a nucleic 
acid molecule that has been isolated free of total cellular nucleic acid. In preferred embodiments, the 
invention concerns a nucleic acid sequence essentially as set forth in FIGS. 6 and 9. The term "as set 
forth in FIG. 6 or 9" means that the nucleic acid sequence substantially corresponds to a portion of 
FIG. 6 or 9. The term "functionally equivalent codon" is used herein to refer to codons that encode 
the same amino acid, such as the six codons for arginine or serine (Table 1, below), and also refers to 
codons that encode biologically equivalent amino acids, as discussed in the following pages. 
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TABLE 1 



Amino Acids 


Codons 


Alanine 


Ala 


A 


GCA 


GCC 


GCG 


GCU 






Cysteine 


Cys 


C 


UGC 


UGU 










Aspartic acid 


Asp 


D 


GAC 


GAU 










Glutamic acid 


Glu 


E 


GAA 


GAG 










Phenylalanine 


Phe 


F 


UUC 


UUU 










Glycine 


Gly 


G 


GGA 


GGC 


GGG 


GOTT 

\J\J\J 






Histidine 


His 


H 


CAC 


CAU 










Isoleucine 


lie 


I 


AUA 


AUC 


AUU 








Lysine 


Lys 


K 


AAA 


AAG 










Leucine 


Leu 


L 


UUA 


UUG 


CUA 


cur 

\^ y^J 




PTTT T 

tuu 


Methionine 


Met 


M 


AUG 












Asparagine 


Asn 


N 


AAC 


AAU 










Proline 


Pro 


P 


CCA 


CCC 


CCG 


ecu 






Glutamine 


Gin 


Q 


CAA 


CAG 










Arginine 


Arg 


R 


AGA 


AGG 


CGA 


CGC 


CGG 


CGU 


Serine 


Ser 


S 


AGCAGU 


UCA 


UCC 


UCG 


UCU 


Threonine 


Thr 


T 


ACA 


ACC 


ACG 


ACU 






Valine 


Val 


V 


GUA 


GUC 


GUG 


GUU 






Tryptophan 


Trp 


W 


UGG 












Tyrosine 


Tyr 


Y 


UAC 


UAU 











Allowing for the degeneracy of the genetic code, sequences that have at least about 50%, 
usually at least about 60%, more usually about 70%, most usually about 80%, preferably at least 
about 90% and most preferably about 95% of nucleotides that are identical to the nucleotides of FIG. 
9 will be sequences that are "as set forth in FIG. 9." Sequences that are essentially the same as those 
set forth in FIG. 9 may also be functionally defined as sequences that are capable of hybridizing to a 
nucleic acid segment containing the complement of FIG. 9 under standard conditions. 
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The DNA segments of the present invention include those encoding biologically functional 
equivalent TS10q23.3 proteins and peptides, as described above. Such sequences may arise as a 
consequence of codon redundancy and amino acid functional equivalency that are known to occur 
naturally within nucleic acid sequences and the proteins thus encoded. Alternatively, functionally 
5 equivalent proteins or peptides may be created via the application of recombinant DNA technology, in 
which changes in the protein structure may be engineered, based on considerations of the properties 
of the amino acids being exchanged. Changes designed by man may be introduced through the 
application of site-directed mutagenesis techniques or may be introduced randomly and screened later 
for the desired function, as described below. 

10 B. Oligonucleotide Probes and Primers 

Naturally, the present invention also encompasses DNA segments that are complementary, or 
essentially complementary, to the sequence set forth in FIGS. 6 and 9. Nucleic acid sequences that 
are "complementary 11 are those that are capable of base-pairing according to the standard Watson- 
Crick complementary rules. As used herein, the term "complementary sequences" means nucleic acid 
15 sequences that are substantially complementary, as may be assessed by the same nucleotide 
comparison set forth above, or as defined as being capable of hybridizing to the nucleic acid segment 
of FIGS. 6 and 9 under relatively stringent conditions such as those described herein. Such sequences 
may encode the entire TS10q23.3 protein or functional or non-functional fragments thereof. 

20 Alternatively, the hybridizing segments may be shorter oligonucleotides. Sequences of 17 

bases long should occur only once in the human genome and, therefore, suffice to specify a unique 
target sequence. Although shorter oligomers are easier to make and increase in vivo accessibility, 
numerous other factors are involved in determining the specificity of hybridization. Both binding 
affinity and sequence specificity of an oligonucleotide to its complementary target increases with 

25 increasing length. It is contemplated that exemplary oligonucleotides of 8, 9, 10, 11, 12, 13, 14, 15, 
16, 17, 18, 19, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100 or more base pairs 
will be used, although others are contemplated. Longer polynucleotides encoding 250, 500, 1000, 
1212, 1500, 2000, 2500, 3000 or 3431 bases and longer are contemplated as well. Such 
oligonucleotides will find use, for example, as probes in Southern and Northern blots and as primers 

30 in amplification reactions. 
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Suitable hybridization conditions will be well known to those of skill in the art. In certain 
applications, for example, substitution of amino acids by site-directed mutagenesis, it is appreciated 
that lower stringency conditions are required. Under these conditions, hybridization may occur even 
though the sequences of probe and target strand are not perfectly complementary, but are mismatched 
5 at one or more positions. Conditions may be rendered less stringent by increasing salt concentration 
and decreasing temperature. For example, a medium stringency condition could be provided by about 
0.1 to 0.25 M NaCl at temperatures of about 37°C to about 55°C, while a low stringency condition 
could be provided by about 0.15 M to about 0.9 M salt, at temperatures ranging from about 20°C to 
about 55°C. Thus, hybridization conditions can be readily manipulated, and thus will generally be a 
1 0 method of choice depending on the desired results. 

In other embodiments, hybridization may be achieved under conditions o£ for example, 50 
mM Tris-HCl (pH 8.3), 75 mM KC1, 3 mM MgCl 2 , 10 mM dithiothreitol, at temperatures between 
approximately 20°C to about 37°C. Other hybridization conditions utilized could include 
15 approximately 10 mM Tris-HCl (pH 8.3), 50 mM KC1, 1.5 |jM MgCI 2 , at temperatures ranging from 
approximately 40°C to about 72°C. Formamide and SDS also may be used to alter the hybridization 
conditions. 

One method of using probes and primers of the present invention is in the search for genes 
related to TS10q23.3 or, more particularly, homologs of TS10q23.3 from other species. The 
20 existence of a murine homolog strongly suggests that other homologs of the human TS10q23.3 will 
be discovered in species more closely related, and perhaps more remote, than mouse. Normally, the 
target DNA will be a genomic or cDNA library, although screening may involve analysis of RNA 
molecules. By varying the stringency of hybridization, and the region of the probe, different degrees 
of homology may be discovered. 

25 Another way of exploiting probes and primers of the present invention is in site-directed, 

or site-specific mutagenesis. Site-specific mutagenesis is a technique useful in the preparation of 
individual peptides, or biologically functional equivalent proteins or peptides, through specific 
mutagenesis of the underlying DNA. The technique further provides a ready ability to prepare 
and test sequence variants, incorporating one or more of the foregoing considerations, by 

30 introducing one or more nucleotide sequence changes into the DNA. Site-specific mutagenesis 
allows the production of mutants through the use of specific oligonucleotide sequences which 
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encode the DNA sequence of the desired mutation, as well as a sufficient number of adjacent 
nucleotides, to provide a primer sequence of sufficient size and sequence complexity to form a 
stable duplex on both sides of the deletion junction being traversed. Typically, a primer of about 
17 to 25 nucleotides in length is preferred, with about 5 to 10 residues on both sides of the 
5 junction of the sequence being altered. 

The technique typically employs a bacteriophage vector that exists in both a single 
stranded and double stranded form. Typical vectors useful in site-directed mutagenesis include 
vectors such as the M13 phage. These phage vectors are commercially available and their use is 
generally well known to those skilled in the art. Double stranded plasmids are also routinely 
10 employed in site directed mutagenesis, which eliminates the step of transferring the gene of 
interest from a phage to a plasmid. 

In general, site-directed mutagenesis is performed by first obtaining a single-stranded 
vector, or melting of two strands of a double stranded vector which includes within its sequence 
a DNA sequence encoding the desired protein. An oligonucleotide primer bearing the desired 

15 mutated sequence is synthetically prepared. This primer is then annealed with the single-stranded 
DNA preparation, taking into account the degree of mismatch when selecting hybridization 
conditions, and subjected to DNA polymerizing enzymes such as E. colt polymerase I Klenow 
fragment, in order to complete the synthesis of the mutation-bearing strand. Thus, a 
heteroduplex is formed wherein one strand encodes the original non-mutated sequence and the 

20 second strand bears the desired mutation. This heteroduplex vector is then used to transform 
appropriate cells, such as E. coli cells, and clones are selected that include recombinant vectors 
bearing the mutated sequence arrangement. 

The preparation of sequence variants of the selected gene using site-directed mutagenesis 
is provided as a means of producing potentially useful species and is not meant to be limiting, as 
25 there are other ways in which sequence variants of genes may be obtained. For example, 
recombinant vectors encoding the desired gene may be treated with mutagenic agents, such as 
hydroxylamine, to obtain sequence variants. 
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C. Antisense Constructs 

In some cases, mutant tumor suppressors may not be non-functional. Rather, they may 
have aberrant functions that cannot be overcome by replacement gene therapy, even where the 
"wild-type" molecule is expressed in amounts in excess of the mutant polypeptide. Antisense 
treatments are one way of addressing this situation. Antisense technology also may be used to 
"knock-out" function of TS10q23.3 in the development of cell lines or transgenic mice for 
research, diagnostic and screening purposes. 

Antisense methodology takes advantage of the fact that nucleic acids tend to pair with 
"complementary" sequences. By complementary, it is meant that polynucleotides are those 
which are capable of base-pairing according to the standard Watson-Crick complementarity 
rules. That is, the larger purines will base pair with the smaller pyrimidines to form combinations 
of guanine paired with cytosine (G:C) and adenine paired with either thymine (A:T) in the case 
of DNA, or adenine paired with uracil (A:U) in the case of RNA. Inclusion of less common 
bases such as inosine, 5-methylcytosine, 6-methyladenine, hypoxanthine and others in hybridizing 
sequences does not interfere with pairing. 

Targeting double-stranded (ds) DNA with polynucleotides leads to triple-helix formation; 
targeting RNA will lead to double-helix formation. Antisense polynucleotides, when introduced 
into a target cell, specifically bind to their target polynucleotide and interfere with transcription, 
RNA processing, transport, translation and/or stability. Antisense RNA constructs, or DNA 
encoding such antisense RNA's, may be employed to inhibit gene transcription or translation or 
both within a host cell, either in vitro or in vivo, such as within a host animal, including a human 
subject. 

Antisense constructs may be designed to bind to the promoter and other control regions, 
exons, introns or even exon-intron boundaries of a gene. It is contemplated that the most 
effective antisense constructs will include regions complementary to intron/exon splice junctions. 
Thus, it is proposed that a preferred embodiment includes an antisense construct with 
complementarity to regions within 50-200 bases of an intron-exon splice junction. It has been 
observed that some exon sequences can be included in the construct without seriously affecting 
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the target selectivity thereof. The amount of exonic material included will vary depending on the 
particular exon and intron sequences used. One can readily test whether too much exon DNA is 
included simply by testing the constructs in vitro to determine whether normal cellular function 
is affected or whether the expression of related genes having complementary sequences is 
5 affected. 

As stated above, "complementary" or "antisense" means polynucleotide sequences that 
are substantially complementary over their entire length and have very few base mismatches. For 
example, sequences of fifteen bases in length may be termed complementary when they have 

10 complementary nucleotides at thirteen or fourteen positions. Naturally, sequences which are 
completely complementary will be sequences which are entirely complementary throughout their 
entire length and have no base mismatches. Other sequences with lower degrees of homology 
also are contemplated. For example, an antisense construct which has limited regions of high 
homology, but also contains a non-homologous region (e.g., ribozyme; see below) could be 

15 designed. These molecules, though having less than 50% homology, would bind to target 
sequences under appropriate conditions. 

It may be advantageous to combine portions of genomic DNA with cDNA or synthetic 
sequences to generate specific constructs. For example, where an intron is desired in the 
20 ultimate construct, a genomic clone will need to be used. The cDNA or a synthesized 
polynucleotide may provide more convenient restriction sites for the remaining portion of the 
construct and, therefore, would be used for the rest of the sequence. 

£). Ribozymes 

Another approach for addressing the "dominant negative" mutant tumor suppressor is 
25 through the use of ribozymes. Although proteins traditionally have been used for catalysis of 

nucleic acids, another class of macromolecules has emerged as useful in this endeavor. 

Ribozymes are RNA-protein complexes that cleave nucleic acids in a site-specific fashion. 

Ribozymes have specific catalytic domains that possess endonuclease activity (Kim and Cook, 

1987; Gerlach et aL, 1987; Forster and Symons, 1987). For example, a large number of 
30 ribozymes accelerate phosphoester transfer reactions with a high degree of specificity, often 

cleaving only one of several phosphoesters in an oligonucleotide substrate (Cook et al. 9 1981; 
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Michel and Westhof, 1990; Reinhold-Hurek and Shub, 1992). This specificity has been 
attributed to the requirement that the substrate bind via specific base-pairing interactions to the 
internal guide sequence ("IGS") of the ribozyme prior to chemical reaction. 

5 Ribozyme catalysis has primarily been observed as part of sequence-specific 

cleavage/ligation reactions involving nucleic acids (Joyce, 1989; Cook et aL 9 1981). For 
example, U.S. Patent No. 5,354,855 reports that certain ribozymes can act as endonucleases 
with a sequence specificity greater than that of known ribonucleases and approaching that of the 
DNA restriction enzymes. Thus, sequence-specific ribozyme-mediated inhibition of gene 
10 expression may be particularly suited to therapeutic applications (Scanlon et al 9 1991; Sarver et 
al., 1990). Recently, it was reported that ribozymes elicited genetic changes in some cells lines 
to which they were applied; the altered genes included the oncogenes H-ras, c-fos and genes of 
HIV. Most of this work involved the modification of a target mRNA, based on a specific mutant 
codon that is cleaved by a specific ribozyme. 

15 E. Vectors for Cloning, Gene Transfer and Expression 

Within certain embodiments expression vectors are employed to express the TS10q23.3 
polypeptide product, which can then be purified and, for example, be used to vaccinate animals to 
generate antisera or monoclonal antibody with which further studies may be conducted. In other 
embodiments, the expression vectors are used in gene therapy. Expression requires that appropriate 

20 signals be provided in the vectors, and which include various regulatory elements, such as 
enhancers/promoters from both viral and mammalian sources that drive expression of the genes 
of interest in host cells. Elements designed to optimize messenger RNA stability and 
translatability in host cells also are defined. The conditions for the use of a number of dominant 
drug selection markers for establishing permanent, stable cell clones expressing the products are 

25 also provided, as is an element that links expression of the drug selection markers to expression 
of the polypeptide. 

(i) Regulatory Elements 



30 



Throughout this application, the term "expression construct" is meant to include any type 
of genetic construct containing a nucleic acid coding for a gene product in which part or all of 
the nucleic acid encoding sequence is capable of being transcribed. The transcript may be 
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translated into a protein, but it need not be. In certain embodiments, expression includes both 
transcription of a gene and translation of mRNA into a gene product. In other embodiments, 
expression only includes transcription of the nucleic acid encoding a gene of interest. 

In preferred embodiments, the nucleic acid encoding a gene product is under 
5 transcriptional control of a promoter. A "promoter" refers to a DNA sequence recognized by 
the synthetic machinery of the cell, or introduced synthetic machinery, required to initiate the 
specific transcription of a gene. The phrase "under transcriptional control" means that the 
promoter is in the correct location and orientation in relation to the nucleic acid to control RNA 
polymerase initiation and expression of the gene. 

10 The term promoter will be used here to refer to a group of transcriptional control 

modules that are clustered around the initiation site for RNA polymerase II. Much of the 
thi nkin g about how promoters are organized derives from analyses of several viral promoters, 
including those for the HSV thymidine kinase (tk) and SV40 early transcription units. These 
studies, augmented by more recent work, have shown that promoters are composed of discrete 

15 functional modules, each consisting of approximately 7-20 bp of DNA, and containing one or 
more recognition sites for transcriptional activator or repressor proteins. 

At least one module in each promoter functions to position the start site for RNA 
synthesis. The best known example of this is the TATA box, but in some promoters lacking a 
TATA box, such as the promoter for the mammalian terminal deoxynucleotidyl transferase gene 
20 and the promoter for the SV40 late genes, a discrete element overlying the start site itself helps 
to fix the place of initiation. 

Additional promoter elements regulate the frequency of transcriptional initiation. 
Typically, these are located in the region 30-1 10 bp upstream of the start site, although a number 
of promoters have recently been shown to contain functional elements downstream of the start 
25 site as well. The spacing between promoter elements frequently is flexible, so that promoter 
function is preserved when elements are inverted or moved relative to one another. In the tk 
promoter, the spacing between promoter elements can be increased to 50 bp apart before activity 
begins to decline. Depending on the promoter, it appears that individual elements can function 
either co-operatively or independently to activate transcription. 
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The particular promoter employed to control the expression of a nucleic acid sequence of 
interest is not believed to be important, so long as it is capable of direction the expression of the 
nucleic acid in the targeted cell. Thus, where a human cell is targeted, it is preferable to position 
the nucleic acid coding region adjacent to and under the control of a promoter that is capable of 
being expressed in a human cell. Generally speaking, such a promoter might include either a 
human or viral promoter. 

t 

In various embodiments, the human cytomegalovirus (CMV) immediate early gene 
promoter, the SV40 early promoter, the Rous sarcoma virus long terminal repeat, rat insulin 
promoter and glyceraldehyde-3 -phosphate dehydrogenase can be used to obtain high-level 
expression of the coding sequence of interest. The use of other viral or mammalian cellular or 
bacterial phage promoters which are well-known in the art to achieve expression of a coding 
sequence of interest is contemplated as well, provided that the levels of expression are sufficient 
for a given purpose. 

By employing a promoter with well-known properties, the level and pattern of expression 
of the protein of interest following transfection or transformation can be optimized. Further, 
selection of a promoter that is regulated in response to specific physiologic signals can permit 
inducible expression of the gene product. Tables 2 and 3 list several elements/promoters which 
may be employed, in the context of the present invention, to regulate the expression of the gene 
of interest. This list is not intended to be exhaustive of all the possible elements involved in the 
promotion of gene expression but, merely, to be exemplary thereof. 

Enhancers are genetic elements that increase transcription from a promoter located at a 
distant position on the same molecule of DNA. Enhancers are organized much like promoters. 
That is, they are composed of many individual elements, each of which binds to one or more 
transcriptional proteins. 

The basic distinction between enhancers and promoters is operational. An enhancer 
region as a whole must be able to stimulate transcription at a distance; this need not be true of a 
promoter region or its component elements. On the other hand, a promoter must have one or 
more elements that direct initiation of RNA synthesis at a particular site and in a particular 
orientation, whereas enhancers lack these specificities. Promoters and enhancers are often 
overlapping and contiguous, often seeming to have a very similar modular organization. 
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Below is a list of viral promoters, cellular promoters/enhancers and inducible 
promoters/enhancers that could be used in combination with the nucleic acid encoding a gene of 
interest in an expression construct (Table 2 and Table 3). Additionally, any promoter/enhancer 
combination (as per the Eukaryotic Promoter Data Base EPDB) could also be used to drive 
5 expression of the gene. Eukaryotic cells can support cytoplasmic transcription from certain 
bacterial promoters if the appropriate bacterial polymerase is provided, either as part of the 
delivery complex or as an additional genetic expression construct. 

TABLE 2 

ENHANCER/PROMOTER 

Immunoglobulin Heavy Chain 
Immunoglobulin Light Chain 
T-Cell Receptor 
HLADQaandDQp 
(3-Interferon 
Interleukin-2 
Interleukin-2 Receptor 

MHC Class n 5 

MHC Class II HLA-DRa 

(3-Actin 
Muscle Creatine Kinase 
Prealbumin (Transthyretin) 
Elastase / 
Metallothionein 

Collagenase 
Albumin Gene 
a-Fetoprotein 
x-Globin 
(3-Globin 
e-fos 
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TABLE 2 cont'd 

c-HA-ras 
Insulin 

Neural Cell Adhesion Molecule (NCAM) 
a 1 -Antitrypsin 

H2B (TH2B) Histone 

Mouse or Type I Collagen 
Glucose-Regulated Proteins (GRP94 and GRP78) 
Rat Growth Hormone 
Human Serum Amyloid A (SAA) 

Troponin I (TNI) 

Platelet-Derived Growth Factor 
Duchenne Muscular Dystrophy 

SV40 

Polyoma 

: ■ 

Retroviruses 

Papillomavirus 

Hepatitis B Virus 

Human Immunodeficiency Virus 

Cytomegalovirus 

Gibbon Ape Leukemia Virus 
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XslCllieill 


XIlUULwl 


TV/IT TT 


illxjl UU1 J_>o IC1 111 A 1 

Heavy metals 


MMTV (mouse mammary tumor 
virus) 


Glucocorticoids 


B-Inteiferon 


poly(rI)X 


AHpnnvims 5 E2 


Ela 




Phorbol Ester ( TP H?0-> 


riollaeenase 


Phorbol Ester (TP A) 


Strnmelvsin 


Phorbol Ester (TP A), IL-1 




Phorbol Ester (TPA^ 


Murine MX Gene 


Interferon, Newcastle Disease Virus 


GRP78 Gene 


A23187 


rv Jl JKA a prool nhi lltn 

UL X iViOwl UglUUUllll 


EL-6 


Vimentin 


Serum 


MHC Class I Gene H-2kB 


Interferon 


HSP70 


Ela, S V40 Large T Anttgen 


Proliferin 


Phorbol Ester-TPA 


Tumor Necrosis Factor 


FMA 


Thyroid Stimulating Hormone a 
Gene 


Thyroid Hormone 


Insulin E Box 


Glucose 



Where a cDNA insert is employed, one will typically desire to include a polyadenylation 
5 signal to effect proper polyadenylation of the gene transcript. The nature of the polyadenylation 
signal is not believed to be crucial to the successful practice of the invention, and any such 
sequence may be employed such as human growth hormone and SV40 polyadenylation signals. 
Also contemplated as an element of the expression cassette is a terminator. These elements can 
serve to enhance message levels and to irdiiimize read through from the cassette into other 
10 sequences. 
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(ii) Selectable Markers 
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In certain embodiments of the invention, the cells contain nucleic acid constructs of the 
present invention, a cell may be identified in vitro or in vivo by including a marker in the 
expression construct. Such markers would confer an identifiable change to the cell permitting 
easy identification of cells containing the expression construct. Usually the inclusion of a drug 
selection marker aids in cloning and in the selection of transformants, for example, genes that 
confer resistance to neomycin, puromycin, hygromycin, DHFR, GPT, zeocin and histidinol are 
useful selectable markers. Alternatively, enzymes such as herpes simplex virus thymidine kinase 
(tk) or chloramphenicol acetyltransferase (CAT) may be employed. Immunologic markers also 
can be employed. The selectable marker employed is not believed to be important, so long as it 
is capable of being expressed simultaneously with the nucleic acid encoding a gene product. 
Further examples of selectable markers are well known to one of skill in the art. 

(iii) Multigene Constructs and IRES 

In certain embodiments of the invention, the use of internal ribosome binding sites 
(IRES) elements are used to create multigene, or polycistronic, messages. IRES elements are 
able to bypass the ribosome scanning model of 5' methylated Cap dependent translation and 
begin translation at internal sites (Pelletier and Sonenberg, 1988). IRES elements from two 
members of the picanovirus family (polio and encephalomyocarditis) have been described 
(Pelletier and Sonenberg, 1988), as well an IRES from a mammalian message (Macejak and 
Sarnow, 1991). IRES elements can be linked to heterologous open reading frames. Multiple 
open reading frames can be transcribed together, each separated by an IRES, creating 
polycistronic messages. By virtue of the IRES element, each open reading frame is accessible to 
ribosomes for efficient translation. Multiple genes can be efficiently expressed using a single 
promoter/enhancer to transcribe a single message. 

Any heterologous open reading frame can be linked to IRES elements. This includes 
genes for secreted proteins, multi-subunit proteins, encoded by independent genes, intracellular 
or membrane-bound proteins and selectable markers. In this way, expression of several proteins 
can be simultaneously engineered into a cell with a single construct and a single selectable 
marker. 
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(iv) Delivery of Expression Vectors 

There are a number of ways in which expression vectors may introduced into cells. In 
certain embodiments of the invention, the expression construct comprises a virus or engineered 
construct derived from a viral genome. The ability of certain viruses to enter cells via receptor- 
mediated endocytosis, to integrate into host cell genome and express viral genes stably and 
efficiently have made them attractive candidates for the transfer of foreign genes into mammalian 
cells (Ridgeway, 1988; Nicolas and Rubenstein, 1988; Baichwal and Sugden, 1986; Temin, 
1986). The first viruses used as gene vectors were DNA viruses including the papovaviruses 
(simian virus 40, bovine papilloma virus, and polyoma) (Ridgeway, 1988; Baichwal and Sugden, 
1986) and adenoviruses (Ridgeway, 1988; Baichwal and Sugden, 1986). These have a relatively 
low capacity for foreign DNA sequences and have a restricted host spectrum. Furthermore, 
their oncogenic potential and cytopathic effects in permissive cells raise safety concerns. They 
can accommodate only up to 8 kb of foreign genetic material but can be readily introduced in a 
variety of cell lines and laboratory animals (Nicolas and Rubenstein, 1988; Temin, 1986). 

One of the preferred methods for in vivo delivery involves the use of an adenovirus 
expression vector. "Adenovirus expression vector" is meant to include those constructs 
containing adenovirus sequences sufficient to (a) support packaging of the construct and (b) to 
express an antisense polynucleotide that has been cloned therein. In this context, expression 
does not require that the gene product be synthesized. 

The expression vector comprises a genetically engineered form of adenovirus. 
Knowledge of the genetic organization of adenovirus, a 36 kb, linear, double-stranded DNA 
virus, allows substitution of large pieces of adenoviral DNA with foreign sequences up to 7 kb 
(Grunhaus and Horwitz, 1992). In contrast to retrovirus, the adenoviral infection of host cells 
does not result in chromosomal integration because adenoviral DNA can replicate in an episomal 
manner without potential genotoxicity. Also, adenoviruses are structurally stable, and no 
genome rearrangement has been detected after extensive amplification. Adenovirus can infect 
virtually all epithelial cells regardless of their cell cycle stage. So far, adenoviral infection 
appears to be linked only to mild disease such as acute respiratory disease in humans. 

Adenovirus is particularly suitable for use as a gene transfer vector because of its mid- 
sized genome, ease of manipulation, high titer, wide target cell range and high infectivity. Both 
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ends of the viral genome contain 100-200 base pair inverted repeats (ITRs), which are cis 
elements necessary for viral DNA replication and packaging. The early (E) and late (L) regions 
of the genome contain different transcription units that are divided by the onset of viral DNA 
replication. The El region (El A and E1B) encodes proteins responsible for the regulation of 
transcription of the viral genome and a few cellular genes. The expression of the E2 region 
(E2A and E2B) results in the synthesis of the proteins for viral DNA replication. These proteins 
are involved in DNA replication, late gene expression and host cell shut-off (Renan, 1990). The 
products of the late genes, including the majority of the viral capsid proteins, are expressed only 
after significant processing of a single primary transcript issued by the major late promoter 
(MLP). The MLP, (located at 16.8 m.u.) is particularly efficient during the late phase of 
infection, and all the mRNA's issued from this promoter possess a 5'-tripartite leader (TPL) 
sequence which makes them preferred mRNA's for translation. 

In a current system, recombinant adenovirus is generated from homologous 
recombination between shuttle vector and provirus vector. Due to the possible recombination 
between two proviral vectors, wild-type adenovirus may be generated from this process. 
Therefore, it is critical to isolate a single clone of virus from an individual plaque and examine its 
genomic structure. 

Generation and propagation of the current adenovirus vectors, which are replication 
deficient, depend on a unique helper cell line, designated 293, which was transformed from 
human embryonic kidney cells by Ad5 DNA fragments and constitutively expresses El proteins 
(Graham et al, 1977). Since the E3 region is dispensable from the adenovirus genome (Jones 
and Shenk, 1978), the current adenovirus vectors, with the help of 293 cells, carry foreign DNA 
in either the El, the D3 or both regions (Graham and Prevec, 1991). In nature, adenovirus can 
package approximately 105% of the wild-type genome (Ghosh-Choudhury et al, 1987), 
providing capacity for about 2 extra kb of DNA. Combined with the approximately 5.5 kb of 
DNA that is replaceable in the El and E3 regions, the maximum capacity of the current 
adenovirus vector is under 7.5 kb, or about 15% of the total length of the vector. More than 
80% of the adenovirus viral genome remains in the vector backbone and is the source of vector- 
borne cytotoxicity. Also, the replication deficiency of the El-deleted virus is incomplete. For 
example, leakage of viral gene expression has been observed with the currently available vectors 
at high multiplicities of infection (MOI) (Mulligan, 1993). 
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Helper cell lines may be derived from human cells such as human embryonic kidney cells, 
muscle cells, hematopoietic cells or other human embryonic mesenchymal or epithelial cells. 
Alternatively, the helper cells may be derived from the cells of other mammalian species that are 
permissive for human adenovirus. Such cells include, e.g., Vero cells or other monkey 
embryonic mesenchymal or epithelial cells. As stated above, the preferred helper cell line is 293 . 

Recently, Racher et aL, (1995) disclosed improved methods for culturing 293 cells and 
propagating adenovirus. In one format, natural cell aggregates are grown by inoculating 
individual cells into 1 liter siliconized spinner flasks (Techne, Cambridge, UK) containing 100- 
200 ml of medium. Following stirring at 40 rpm, the cell viability is estimated with trypan blue. 
In another format, Fibra-Cel microcarriers (Bibby Sterlin, Stone, UK) (5 g/1) is employed as 
follows. A cell inoculum, resuspended in 5 ml of medium, is added to the carrier (50 ml) in a 
250 ml Erlenmeyer flask and left stationary, with occasional agitation, for 1 to 4 h. The medium 
is then replaced with 50 ml of fresh medium and shaking initiated. For virus production, cells are 
allowed to grow to about 80% confluence, after which time the medium is replaced (to 25% of 
the final volume) and adenovirus added at an MOl of 0.05. Cultures are left stationary 
overnight, following which the volume is increased to 100% and shaking commenced for another 
72 h. 

Other than the requirement that the adenovirus vector be replication defective, or at least 
conditionally defective, the nature of the adenovirus vector is not believed to be crucial to the 
successful practice of the invention. The adenovirus may be of any of the 42 different known 
serotypes or subgroups A-F. Adenovirus type 5 of subgroup C is the preferred starting material 
in order to obtain the conditional replication-defective adenovirus vector for use in the present 
invention. This is because Adenovirus type 5 is a human adenovirus about which a great deal of 
biochemical and genetic information is known, and it has historically been used for most 
constructions employing adenovirus as a vector. 

As stated above, the typical vector according to the present invention is replication 
defective and will not have an adenovirus El region. Thus, it will be most convenient to 
introduce the polynucleotide encoding the gene of interest at the position from which the El- 
coding sequences have been removed. However, the position of insertion of the construct within 
the adenovirus sequences is not critical to the invention. The polynucleotide encoding the gene 
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of interest may also be inserted in lieu of the deleted E3 region in E3 replacement vectors as 
described by Karlsson et aL, (1986) or in the E4 region where a helper cell line or helper virus 
complements the E4 defect. 

Adenovirus is easy to grow and manipulate and exhibits broad host range in vitro and in 
5 vivo. This group of viruses can be obtained in high titers, e.g., 10 9 -10 u plaque-forming units per 
ml, and they are highly infective. The life cycle of adenovirus does not require integration into 
the host cell genome. The foreign genes delivered by adenovirus vectors are episomal and, 
therefore, have low genotoxicity to host cells. No side effects have been reported in studies of 
vaccination with wild-type adenovirus (Couch et aL, 1963; Top et aL, 1971), demonstrating 
10 their safety and therapeutic potential as in vivo gene transfer vectors. 

Adenovirus vectors have been used in eukaryotic gene expression (Levrero et aL, 1991; 
Gomez-Foix et aL, 1992) and vaccine development (Grunhaus and Horwitz, 1992; Graham and 
Prevec, 1992). Recently, animal studies suggested that recombinant adenovirus could be used 
for gene therapy (Stratford-Perricaudet and Perricaudet, 1991; Stratford-Perricaudet et aL, 
15 1990; Rich et aL, 1993). Studies in administering recombinant adenovirus to different tissues 
include trachea instillation (Rosenfeld et aL, 1991; Rosenfeld et aL, 1992), muscle injection 
(Ragot et aL, 1993), peripheral intravenous injections (Herz and Gerard, 1993) and stereotactic 
inoculation into the brain (Le Gal La Salle et aL, 1993). 

The retroviruses are a group of single-stranded RNA viruses characterized by an ability 
20 to convert their RNA to double-stranded DNA in infected cells by a process of reverse- 
transcription (Coffin, 1990). The resulting DNA then stably integrates into cellular 
chromosomes as a provirus and directs synthesis of viral proteins. The integration results in the 
retention of the viral gene sequences in the recipient cell and its descendants. The retroviral 
genome contains three genes, gag, pol, and env that code for capsid proteins, polymerase 
25 enzyme, and envelope components, respectively. A sequence found upstream from the gag gene 
contains a signal for packaging of the genome into virions. Two long terminal repeat (LTR) 
sequences are present at the 5' and 3' ends of the viral genome. These contain strong promoter 
and enhancer sequences and are also required for integration in the host cell genome (Coffin, 
1990). 
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In order to construct a retroviral vector, a nucleic acid encoding a gene of interest is 
inserted into the viral genome in the place of certain viral sequences to produce a virus that is 
replication-defective. In order to produce virions, a packaging cell line containing the gag, pol, 
and env genes but without the LTR and packaging components is constructed (Mann et aL, 

5 1983). When a recombinant plasmid containing a cDNA, together with the retroviral LTR and 
packaging sequences is introduced into this cell line (by calcium phosphate precipitation for 
example), the packaging sequence allows the RNA transcript of the recombinant plasmid to be 
packaged into viral particles, which are then secreted into the culture media (Nicolas and 
Rubenstein, 1988; Temin, 1986; Mann et aL, 1983). The media containing the recombinant 

10 retroviruses is then collected, optionally concentrated, and used for gene transfer. Retroviral 
vectors are able to infect a broad variety of cell types. However, integration and stable 
expression require the division of host cells (Paskind et aL, 1975). 

A novel approach designed to allow specific targeting of retrovirus vectors was recently 
developed based on the chemical modification of a retrovirus by the chemical addition of lactose 
15 residues to the viral envelope. This modification could permit the specific infection of 
hepatocytes via sialoglycoprotein receptors. 

A different approach to targeting of recombinant retroviruses was designed in which 
biotinylated antibodies against a retroviral envelope protein and against a specific cell receptor 
were used. The antibodies were coupled via the biotin components by using streptavidin (Roux 
20 et aL, 1989). Using antibodies against major histocompatibility complex class I and class II 
antigens, they demonstrated the infection of a variety of human cells that bore those surface 
antigens with an ecotropic virus in vitro (Roux et aL, 1989). 

There are certain limitations to the use of retrovirus vectors in all aspects of the present 
invention. For example, retrovirus vectors usually integrate into random sites in the cell genome. 

25 This can lead to insertional mutagenesis through the interruption of host genes or through the 
insertion of viral regulatory sequences that can interfere with the function of flanking genes 
(Varmus et aL, 1981). Another concern with the use of defective retrovirus vectors is the 
potential appearance of wild-type replication-competent virus in the packaging cells. This can 
result from recombination events in which the intact- sequence from the recombinant virus 

30 inserts upstream from the gag, pol, env sequence integrated in the host cell genome. However, 
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new packaging cell lines are now available that should greatly decrease the likelihood of 
recombination (Markowitz et al, 1988; Hersdorffer et al, 1990). 

Other viral vectors may be employed as expression constructs in the present invention. 
Vectors derived from viruses such as vaccinia virus (Ridgeway, 1988; Baichwal and Sugden, 
5 1986; Coupar et al, 1988) adeno-associated virus (AAV) (Ridgeway, 1988; Baichwal and 
Sugden, 1986; Hermonat and Muzycska, 1984) and herpesviruses may be employed. They offer 
several attractive features for various mammalian cells (Friedmann, 1989; Ridgeway, 1988; 
Baichwal and Sugden, 1986; Coupar et al, 1988; Horwich et al t 1990). 

With the recent recognition of defective hepatitis B viruses, new insight was gained into 
10 the structure-function relationship of different viral sequences. In vitro studies showed that the 
virus could retain the ability for helper-dependent packaging and reverse transcription despite the 
deletion of up to 80% of its genome (Horwich et al, 1990). This suggested that large portions 
of the genome could be replaced with foreign genetic material. The hepatotropism and 
persistence (integration) were particularly attractive properties for liver-directed gene transfer. 
15 Chang et al, recently introduced the chloramphenicol acetyltransferase (CAT) gene into duck 
hepatitis B virus genome in the place of the polymerase, surface, and pre-surface coding 
sequences. It was co-transfected with wild-type virus into an avian hepatoma cell line. Culture 
media containing high titers of the recombinant virus were used to infect primary duckling 
hepatocytes. Stable CAT gene expression was detected for at least 24 days after transfection 
20 (Change/ al., 1991). 

In order to effect expression of sense or antisense gene constructs, the expression 
construct must be delivered into a cell. This delivery may be accomplished in vitro, as in 
laboratory procedures for transforming cells lines, or in vivo or ex v/vo, as in the treatment of 
certain disease states. One mechanism for delivery is via viral infection where the expression 
25 construct is encapsidated in an infectious viral particle. 

Several non-viral methods for the transfer of expression constructs into cultured 
mammalian cells also are contemplated by the present invention. These include calcium 
phosphate precipitation (Graham and Van Der Eb, 1973; Chen and Okayama, 1987; Rippe et al., 
1990) DEAE-dextran (Gopal, 1985), electroporation (Tur-Kaspa et al, 1986; Potter et al, 
30 1984), direct microinjection (Harland and Weintraub, 1985), DNA-loaded liposomes (Nicolau 
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and Sene, 1982; Fraley et al, 1979) and lipofectamine-DNA complexes, cell sonication 
(Fechheimer et aL, 1987), gene bombardment using high velocity microprojectiles (Yang et aL, 
1990), and receptor-mediated transfection (Wu and Wu, 1987; Wu and Wu, 1988). Some of 
these techniques may be successfully adapted for in vivo or ex vivo use. 

5 Once the expression construct has been delivered into the cell the nucleic acid encoding 

the gene of interest may be positioned and expressed at different sites. In certain embodiments, 
the nucleic acid encoding the gene may be stably integrated into the genome of the cell. This 
integration may be in the cognate location and orientation via homologous recombination (gene 
replacement) or it may be integrated in a random, non-specific location (gene augmentation). In 

10 yet further embodiments, the nucleic acid may be stably maintained in the cell as a separate, 
episomal segment of DNA. Such nucleic acid segments or "episomes" encode sequences 
sufficient to permit maintenance and replication independent of or in synchronization with the 
host cell cycle. How the expression construct is delivered to a cell and where in the cell the 
nucleic acid remains is dependent on the type of expression construct employed. 

15 In yet another embodiment of the invention, the expression construct may simply consist 

of naked recombinant DNA or plasmids. Transfer of the construct may be performed by any of 
the methods mentioned above which physically or chemically permeabilize the cell membrane. 
This is particularly applicable for transfer in vitro but it may be applied to in vivo use as well. 
Dubensky et aL (1984) successfully injected polyomavirus DNA in the form of calcium 

20 phosphate precipitates into liver and spleen of adult and newborn mice demonstrating active viral 
replication and acute infection. Benvenisty and Neshif (1986) also demonstrated that direct 
intraperitoneal injection of calcium phosphate-precipitated plasmids results in expression of the 
transfected genes. It is envisioned that DNA encoding a gene of interest may also be transferred 
in a similar manner in vivo and express the gene product. 

25 In still another embodiment of the invention for transferring a naked DNA expression 

construct into cells may involve particle bombardment. This method depends on the ability to 
accelerate DNA-coated microprojectiles to a high velocity allowing them to pierce cell 
membranes and enter cells without killing them (Klein et aL, 1987). Several devices for 
accelerating small particles have been developed. One such device relies on a high voltage 

30 discharge to generate an electrical current, which in turn provides the motive force (Y ang et aL, 
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1990). The microprojectiles used have consisted of biologically inert substances such as 
tungsten or gold beads. 

Selected organs including the liver, skin, and muscle tissue of rats and mice have been 
bombarded in vivo (Y ang et al, 1990; Zelenin et al, 1991). This may require surgical exposure 
5 of the tissue or cells, to eliminate any intervening tissue between the gun and the target organ, 
i.e., ex vivo treatment. Again, DNA encoding a particular gene may be delivered via this method 
and still be incorporated by the present invention. 

In a further embodiment of the invention, the expression construct may be entrapped in a 
liposome. Liposomes are vesicular structures characterized by a phospholipid bilayer membrane 
10 and an inner aqueous medium. Multilamellar liposomes have multiple lipid layers separated by 
aqueous medium. They form spontaneously when phospholipids are suspended in an excess of 
aqueous solution. The lipid components undergo self-rearrangement before the formation of 
closed structures and entrap water and dissolved solutes between the lipid bilayers (Ghosh and 
Bachhawat, 1991). Also contemplated are lipofectamine-DNA complexes. 

15 Liposome-mediated nucleic acid delivery and expression of foreign DNA in vitro has 

been very successful. Wong et al, (1980) demonstrated the feasibility of liposome-mediated 
delivery and expression of foreign DNA in cultured chick embryo, HeLa and hepatoma cells. 
Nicolau et al, (1987) accomplished successful liposome-mediated gene transfer in rats after 
intravenous injection. 

20 J n certain embodiments of the invention, the liposome may be complexed with a 

hemagglutinating virus (HVJ). This has been shown to facilitate fusion with the cell membrane 
and promote cell entry of liposome-encapsulated DNA (Kaneda et al, 1989). In other 
embodiments, the liposome may be complexed or employed in conjunction with nuclear non- 
histone chromosomal proteins (HMG-1) (Kato et al, 1991). In yet further embodiments, the 

25 liposome may be complexed or employed in conjunction with both HVJ and HMG-1. In that 
such expression constructs have been successfully employed in transfer and expression of nucleic 
acid in vitro and in vivo, then they are applicable for the present invention. Where a bacterial 
promoter is employed in the DNA construct, it also will be desirable to include within the 
liposome an appropriate bacterial polymerase. 
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Other expression constructs which can be employed to deliver a nucleic acid encoding a 
particular gene into cells are receptor-mediated delivery vehicles. These take advantage of the 
selective uptake of macromolecules by receptor-mediated endocytosis in almost all eukaryotic 
cells. Because of the cell type-specific distribution of various receptors, the delivery can be 
highly specific (Wu and Wu, 1993). 

Receptor-mediated gene targeting vehicles generally consist of two components: a cell 
receptor-specific ligand and a DNA-binding agent. Several ligands have been used for receptor- 
mediated gene transfer. The most extensively characterized ligands are asialoorosomucoid 
(ASOR) (Wu and Wu, 1987) and transferrin (Wagner et al., 1990). Recently, a synthetic 
neoglycoprotein, which recognizes the same receptor as ASOR, has been used as a gene delivery 
vehicle (Ferkol et ai t 1993; Perales et al t 1994) and epidermal growth factor (EGF) has also 
been used to deliver genes to squamous carcinoma cells (Myers, EPO 0273085). 

In other embodiments, the delivery vehicle may comprise a ligand and a liposome. For 
example, Nicolau et al. f (1987) employed lactosyl-ceramide, a galactose-terminal 
asialganglioside, incorporated into liposomes and observed an increase in the uptake of the 
insulin gene by hepatocytes. Thus, it is feasible that a nucleic acid encoding a particular gene 
also may be specifically delivered into a cell type such as lung, epithelial or tumor cells, by any 
number of receptor-ligand systems with or without liposomes. For example, epidermal growth 
factor (EGF) may be used as the receptor for mediated delivery of a nucleic acid encoding a 
gene in many tumor cells that exhibit upregulation of EGF receptor. f Mannose can be used to 
target the mannose receptor on liver cells. Also, antibodies to CDS (CLL), CD22 (lymphoma), 
CD25 (T-cell leukemia) and MAA (melanoma) can similarly be used as targeting moieties. 

In certain embodiments, gene transfer may more easily be performed under ex vivo 
conditions. Ex vivo gene therapy refers to the isolation of cells from an animal, the delivery of a 
nucleic acid into the cells in vitro, and then the return of the modified cells back into an animal. 
This may involve the surgical removal of tissue/organs from an animal or the primary culture of 
cells and tissues. 

Primary mammalian cell cultures may be prepared in various ways. In order for the cells to be 
kept viable while in vitro and in contact with the expression construct, it is necessary to ensure that 
the cells maintain contact with the correct ratio of oxygen and carbon dioxide and nutrients but are 

9833907 A 1 I > 



WO 98/33907 PCT/US98/00353 

-48- 

protected from microbial contamination. Cell culture techniques are well documented and are 
disclosed herein by reference (Freshner, 1992). 

One embodiment of the foregoing involves the use of gene transfer to immortalize cells 
5 for the production of proteins. The gene for the protein of interest may be transferred as 
described above into appropriate host cells followed by culture of cells under the appropriate 
conditions. The gene for virtually any polypeptide may be employed in this manner. The 
generation of recombinant expression vectors, and the elements included therein, are discussed 
above. Alternatively, the protein to be produced may be an endogenous protein normally 
10 synthesized by the cell in question. 

Examples of useful mammalian host cell lines are Vero and HeLa cells and cell lines of 
Chinese hamster ovary, W138, BHK, COS-7, 293, HepG2, NIH3T3, RIN and MDCK cells. In 
addition, a host cell strain may be chosen that modulates the expression of the inserted 
15 sequences, or modifies and process the gene product in the manner desired. Such modifications 
{e.g., glycosylation) and processing (e.g., cleavage) of protein products may be important for the 
function of the protein. Different host cells have characteristic and specific mechanisms for the 
post-translational processing and modification of proteins. Appropriate cell lines or host systems 
can be chosen to insure the correct modification and processing of the foreign protein expressed. 

20 

A number of selection systems may be used including, but not limited to, HSV thymidine 
kinase, hypoxanthine-guanine phosphoribosyltransferase and adenine phosphoribosyltransferase 
genes, in tk- y hgprt- or aprt- cells, respectively. Also, anti-metabolite resistance can be used as 
the basis of selection for dhfr, that confers resistance to; gpt 7 that confers resistance to 
25 mycophenolic acid; neo, that confers resistance to the aminoglycoside G418; and hygro, that 
confers resistance to hygromycin. 

Animal cells can be propagated in vitro in two modes: as non-anchorage dependent cells 
growing in suspension throughout the bulk of the culture or as anchorage-dependent cells 
30 requiring attachment to a solid substrate for their propagation (i.e., a monolayer type of cell 
growth). 
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Non-anchorage dependent or suspension cultures from continuous established cell lines 
are the most widely used means of large scale production of cells and cell products. However, 
suspension cultured cells have limitations, such as tumorigenic potential and lower protein 
production than adherent T-cells. 

Large scale suspension culture of mammalian cells in stirred tanks is a common method 
for production of recombinant proteins. Two suspension culture reactor designs are in wide use 
- the stirred reactor and the airlift reactor. The stirred design has successfully been used on an 
8000 liter capacity for the production of interferon. Cells are grown in a stainless steel tank with 
a height-to-diameter ratio of 1 : 1 to 3: 1 . The culture is usually mixed with one or more agitators, 
based on bladed disks or marine propeller patterns. Agitator systems offering less shear forces 
than blades have been described. Agitation may be driven either directly or indirectly by 
magnetically coupled drives. Indirect drives reduce the risk of microbial contamination through 
seals on stirrer shafts. 

The airlift reactor, also initially described for microbial fermentation and later adapted for 
mammalian culture, relies on a gas stream to both mix and oxygenate the culture. The gas 
stream enters a riser section of the reactor and drives circulation. Gas disengages at the culture 
surface, causing denser liquid free of gas bubbles to travel downward in the downcomer section 
of the reactor. The main advantage of this design is the simplicity and lack of need for 
mechanical mixing. Typically, the height-to-diameter ratio is 10:1. The airlift reactor scales up 
relatively easily, has good mass transfer of gases and generates relatively low shear forces. 

The antibodies of the present invention are particularly useful for the isolation of antigens 
by immunoprecipitation. Immunoprecipitation involves the separation of the target antigen 
component from a complex mixture, and is used to discriminate or isolate minute amounts of 
protein. For the isolation of membrane proteins cells must be solubilized into detergent micelles. 
Nonionic salts are preferred, since other agents such as bile salts, precipitate at acid pH or in the 
presence of bivalent cations. Antibodies are and their uses are discussed further, below. 
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In another aspect, the present invention contemplates an antibody that is immunoreactive 
with a TSl0q23.3 molecule of the present invention, or any portion thereof An antibody can be 
a polyclonal or a monoclonal antibody. In a preferred embodiment, an antibody is a monoclonal 
5 antibody. Means for preparing and characterizing antibodies are well known in the art (see, e.g., 
Howell and Lane, 1988). 

Briefly, a polyclonal antibody is prepared by immunizing an animal with an immunogen 
comprising a polypeptide of the present invention and collecting antisera from that immunized 
animal. A wide range of animal species can be used for the production of antisera. Typically an 
10 animal used for production of anti-antisera is a non-human animal including rabbits, mice, rats, 
hamsters, pigs or horses. Because of the relatively large blood volume of rabbits, a rabbit is a 
preferred choice for production of polyclonal antibodies. 

Antibodies, both polyclonal and monoclonal, specific for isoforms of antigen may be 
prepared using conventional immunization techniques, as will be generally known to those of 
15 skill in the art. A composition containing antigenic epitopes of the compounds of the present 
invention can be used to immunize one or more experimental animals, such as a rabbit or mouse, 
which will then proceed to produce specific antibodies against the compounds of the present 
invention. Polyclonal antisera may be obtained, after allowing time for antibody generation, 
simply by bleeding the animal and preparing serum samples from the whole blood. 

20 It is proposed that the monoclonal antibodies of the present invention will find useful 

application in standard immunochemical procedures, such as ELISA and Western blot methods 
and in immunohistochemical procedures such as tissue staining, as well as in other procedures 
which may utilize antibodies specific to TS10q23.3-related antigen epitopes. Additionally, it is 
proposed that monoclonal antibodies specific to the particular TS10q23.3 of different species 

25 may be utilized in other useful applications 

In general, both polyclonal and monoclonal antibodies against TS10q23.3 may be used in 
a variety of embodiments. For example, they may be employed in antibody cloning protocols to 
obtain cDNAs or genes encoding other TS10q23.3. They may also be used in inhibition studies 
to analyze the effects of TS10q23.3 related peptides in cells or animals. Anti-TS10q23.3 
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antibodies will also be useful in immunolocalization studies to analyze the distribution of 
TS10q23.3 during various cellular events, for example, to determine the cellular or tissue- 
specific distribution of TS10q23.3 polypeptides under different points in the cell cycle. A 
particularly useful application of such antibodies is in purifying native or recombinant 
TS10q23.3, for example, using an antibody affinity column. The operation of all such 
immunological techniques will be known to those of skill in the art in light of the present 
disclosure. 

Means for preparing and characterizing antibodies are well known in the art (see, e.g., 
Harlow and Lane, 1988; incorporated herein by reference). More specific examples of 
monoclonal antibody preparation are give in the examples below. 

As is well known in the art, a given composition may vary in its immunogenicity. It is 
often necessary therefore to boost the host immune system, as may be achieved by coupling a 
peptide or polypeptide immunogen to a carrier. Exemplary and preferred carriers are keyhole 
limpet hemocyanin (KLH) and bovine serum albumin (BSA). Other albumins such as ovalbumin, 
mouse serum albumin or rabbit serum albumin can also be used as carriers. Means for 
conjugating a polypeptide to a carrier protein are well known in the art and include 
glutaraldehyde, /w-maleimidobencoyl-N-hydroxysuccinimide ester, carbodiimide and bis- 
biazotized benzidine. 

As also is well known in the art, the immunogenicity of a particular immunogen 
composition can be enhanced by the use of non-specific stimulators of the immune response, 
known as adjuvants. Exemplary and preferred adjuvants include complete Freund's adjuvant (a 
non-specific stimulator of the immune response containing killed Mycobacterium tuberculosis), 
incomplete Freund's adjuvants and aluminum hydroxide adjuvant. 

The amount of immunogen composition used in the production of polyclonal antibodies 
varies upon the nature of the immunogen as well as the animal used for immunization. A variety 
of routes can be used to administer the immunogen (subcutaneous, intramuscular, intradermal, 
intravenous and intraperitoneal). The production of polyclonal antibodies may be monitored by 
sampling blood of the immunized animal at various points following immunization. A second, 
booster, injection may also be given. The process of boosting and titering is repeated until a 
suitable titer is achieved. When a desired level of immunogenicity is obtained, the immunized 
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animal can be bled and the serum isolated and stored, and/or the animal can be used to generate 
mAbs. 

MAbs may be readily prepared through use of well-known techniques, such as those 
exemplified in U.S. Patent 4,196,265, incorporated herein by reference. Typically, this technique 
involves immunizing a suitable animal with a selected immunogen composition, e.g., a purified or 
partially purified TS10q23.3 protein, polypeptide or peptide or cell expressing high levels of 
TS10q23.3. The immunizing composition is administered in a manner effective to stimulate 
antibody producing cells. Rodents such as mice and rats are preferred animals, however, the use 
of rabbit, sheep frog cells is also possible. The use of rats may provide certain advantages 
(Goding, 1986), but mice are preferred, with the BALB/c mouse being most preferred as this is 
most routinely used and generally gives a higher percentage of stable fusions. 

Following immunization, somatic cells with the potential for producing antibodies, 
specifically B-lymphocytes (B-cells), are selected for use in the mAb generating protocol. These 
cells may be obtained from biopsied spleens, tonsils or lymph nodes, or from a peripheral blood 
sample. Spleen cells and peripheral blood cells are preferred, the former because they are a rich 
source of antibody-producing cells that are in the dividing plasmablast stage, and the latter 
because peripheral blood is easily accessible. Often, a panel of animals will have been immunized 
and the spleen of animal with the highest antibody titer will be removed and the spleen 
lymphocytes obtained by homogenizing the spleen with a syringe. Typically, a spleen from an 
immunized mouse contains approximately 5 x 10 7 to 2 x 10 8 lymphocytes. 

The antibody-producing B lymphocytes from the immunized animal are then fused with 
cells of an immortal myeloma cell, generally one of the same species as the animal that was 
immunized. Myeloma cell lines suited for use in hybridoma-producing fusion procedures 
preferably are non-antibody-producing, have high fusion efficiency, and enzyme deficiencies that 
render then incapable of growing in certain selective media which support the growth of only the 
desired fused cells (hybridomas). 

Any one of a number of myeloma cells may be used, as are known to those of skill in the 
art (Goding, 1986; Campbell, 1984). For example, where the immunized animal is a mouse, one 
may use P3-X63/Ag8, P3-X63-Ag8.653, NSl/l.Ag 4 1, Sp210-Agl4, FO, NSO/U, MPC-11, 
MPC1 1-X45-GTG 1.7 and S194/5XX0 Bui; for rats, one may use R210.RCY3, Y3-Ag 1.2.3, 
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IR983F and 4B210; and U-266, GM1500-GRG2, LICR-LON-HMy2 and UC729-6 are all useful 
in connection with cell fusions. 

Methods for generating hybrids of antibody-producing spleen or lymph node cells and 
myeloma cells usually comprise mixing somatic cells with myeloma cells in a 2:1 ratio, though 
the ratio may vary from about 20:1 to about 1:1, respectively, in the presence of an agent or 
agents (chemical or electrical) that promote the fusion of cell membranes. Fusion methods using 
Sendai virus have been described (Kohler and Milstein, 1975; 1976), and those using 
polyethylene glycol (PEG), such as 37% (v/v) PEG, by Gefter et al y (1977). The use of 
electrically induced fusion methods is also appropriate (Goding, 1986). 

Fusion procedures usually produce viable hybrids at low frequencies, around 1 x 10" 6 to 
1 x 10~ 8 , However, this does not pose a problem, as the viable, fused hybrids are differentiated 
from the parental, unfused cells (particularly the unfused myeloma cells that would normally 
continue to divide indefinitely) by culturing in a selective medium. The selective medium is 
generally one that contains an agent that blocks the de novo synthesis of nucleotides in the tissue 
culture media. Exemplary and preferred agents are aminopterin, methotrexate, and azaserine. 
Aminopterin and methotrexate block de novo synthesis of both purines and pyrimidines, whereas 
azaserine blocks only purine synthesis. Where aminopterin or methotrexate is used, the media is 
supplemented with hypoxanthine and thymidine as a source of nucleotides (HAT medium). 
Where azaserine is used, the media is supplemented with hypoxanthine. 

The preferred selection medium is HAT. Only cells capable of operating nucleotide 
salvage pathways are able to survive in HAT medium. The myeloma cells are defective in key 
enzymes of the salvage pathway, e.g., hypoxanthine phosphoribosyl transferase (HPRT), and 
they cannot survive. The B-cells can operate this pathway, but they have a limited life span in 
culture and generally die within about two weeks. Therefore, the only cells that can survive in 
the selective media are those hybrids formed from myeloma and B-cells. 

This culturing provides a population of hybridomas from which specific hybridomas are 
selected. Typically, selection of hybridomas is performed by culturing the cells by single-clone 
dilution in microtiter plates, followed by testing the individual clonal supernatants (after about 
two to three weeks) for the desired reactivity. The assay should be sensitive, simple and rapid, 
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such as radioimmunoassays, enzyme immunoassays, cytotoxicity assays, plaque assays, dot 
immunobinding assays, and the like. 

The selected hybridomas would then be serially diluted and cloned into individual 
antibody-producing cell lines, which clones can then be propagated indefinitely to provide mAbs. 
The cell lines may be exploited for mAb production in two basic ways. A sample of the 
hybridoma can be injected (often into the peritoneal cavity) into a histocompatible animal of the 
type that was used to provide the somatic and myeloma cells for the original fusion. The 
injected animal develops tumors secreting the specific monoclonal antibody produced by the 
fused cell hybrid. The body fluids of the animal, such as serum or ascites fluid, can then be 
tapped to provide mAbs in high concentration. The individual cell lines could also be cultured in 
vitro, where the mAbs are naturally secreted into the culture medium from which they can be 
readily obtained in high concentrations. mAbs produced by either means may be further purified, 
if desired, using filtration, centrifugation and various chromatographic methods such as HPLC or 
affinity chromatography. 

IV. Diagnosing Cancers Involving TS10q23.3 

The present inventors have determined that alterations in TS10q23.3 are associated with 
malignancy. Therefore, TS10q23.3 and the corresponding gene may be employed as a 
diagnostic or prognostic indicator of cancer. More specifically, point mutations, deletions, 
insertions or regulatory pertubations relating to TS10q23.3 may cause cancer or promote cancer 
development, cause or promoter tumor progression at a primary site, and/or cause or promote 
metastasis. Other phenomena associated with malignancy that may be affected by TS10q23.3 
expression include angiogenesis and tissue invasion. 

A. Genetic Diagnosis 

One embodiment of the instant invention comprises a method for detecting variation in 
the expression of TS10q23.3. This may comprises determining that level of TS10q23.3 or 
determining specific alterations in the expressed product. Obviously, this sort of assay has 
importance in the diagnosis of related cancers. Such cancer may involve cancers of the brain 
(glioblastomas, medulloblastoma, astrocytoma, oligodendroglioma, ependymomas), lung, liver, 
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spleen, kidney^ pancreas, small intestine, blood cells, lymph node, colon, breast, endometrium, 
stomach, prostate, testicle, ovary, skin, head and neck, esophagus, bone marrow, blood or other 
tissue. In particular, the present invention relates to the diagnosis of gliomas. 

The biological sample can be any tissue or fluid. Various embodiments include cells of 
5 the skin, muscle, facia, brain, prostate, breast, endometrium, lung, head & neck, pancreas, small 
intestine, blood cells, liver, testes, ovaries, colon, skin, stomach, esophagus, spleen, lymph node, 
bone marrow or kidney. Other embodiments include fluid samples such as peripheral blood, 
lymph fluid, ascites, serous fluid, pleural effusion, sputum, cerebrospinal fluid, lacrimal fluid, 
stool or urine. 

10 Nucleic acid used is isolated from cells contained in the biological sample, according to 

standard methodologies (Sambrook et aL y 1989). The nucleic acid may be genomic DNA or 
fractionated or whole cell RNA. Where RNA is used, it may be desired to convert the RNA to a 
complementary DNA. In one embodiment, the RNA is whole cell RNA; in another, it is poly-A 
RNA. Normally, the nucleic acid is amplified. 

1 5 Depending on the format, the specific nucleic acid of interest is identified in the sample 

directly using amplification or with a second, known nucleic acid following amplification. Next, 
the identified product is detected. In certain applications, the detection may be performed by 
visual means (e.g., ethidium bromide staining of a gel). Alternatively, the detection may involve 
indirect identification of the product via chemiluminescence, radioactive scintigraphy of 

20 radiolabel or fluorescent label or even via a system using electrical or thermal impulse signals 
(Affymax Technology; Bellus, 1994). 

Following detection, one may compare the results seen in a given patient with a 
statistically significant reference group of normal patients and patients that have TS10q23.3- 
related pathologies. In this way, it is possible to correlate the amount or kind of TS10q23.3 
25 detected with various clinical states. 

Various types of defects are to be identified. Thus, "alterations" should be read as 
including deletions, insertions, point mutations and duplications. Point mutations result in stop 
codons, frameshift mutations or amino acid substitutions. Somatic mutations are those 
occurring in non-gennline tissues. Germ-line tissue can occur in any tissue and are inherited. 
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Mutations in and outside the coding region also may affect the amount of TS10q23.3 produced, 
both by altering the transcription of the gene or in destabilizing or otherwise altering the 
processing of either the transcript (mRNA) or protein. 

A variety of different assays are contemplated in this regard, including but not limited to, 
5 fluorescent in situ hybridization (FISH), direct DNA sequencing, PFGE analysis, Southern or 
Northern blotting, single-stranded conformation analysis (SSCA), RNAse protection assay, 
allele-specific oligonucleotide (ASO), dot blot analysis, denaturing gradient gel electrophoresis, 
RFLP and PCR-SSCP. 

(i) Primers and Probes 

10 The term primer, as defined herein, is meant to encompass any nucleic acid that is 

capable of priming the synthesis of a nascent nucleic acid in a template-dependent process. 
Typically, primers are oligonucleotides from ten to twenty base pairs in length, but longer 
sequences can be employed. Primers may be provided in double-stranded or single-stranded 
form, although the single-stranded form is preferred. Probes are defined differently, although 

15 they may act as primers. Probes, while perhaps capable of priming, are designed to binding to 
the target DNA or RNA and need not be used in an amplification process. 

In preferred embodiments, the probes or primers are labeled with radioactive species 

32> 14 33 3 

( P, C, S, H, or other label), with a fluorophore (rhodamine, fluorescein) or a 
chemillumiscent (luciferase). 

20 (ii) Template Dependent Amplification Methods 

A number of template dependent processes are available to amplify the marker sequences 
present in a given template sample. One of the best known amplification methods is the 
polymerase chain reaction (referred to as PCR™) which is described in detail in U.S. Patent Nos. 
4,683,195, 4,683,202 and 4,800,159, and in Innis et al., 1990, each of which is incorporated 
25 herein by reference in its entirety. 

Briefly, in PCR, two primer sequences are prepared that are complementary to regions 
on opposite complementary strands of the marker sequence. An excess of deoxynucleoside 
triphosphates are added to a reaction mixture along with a DNA polymerase, e.g., Tag 
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polymerase. If the marker sequence is present in a sample, the primers will bind to the marker 
and the polymerase will cause the primers to be extended along the marker sequence by adding 
on nucleotides. By raising and lowering the temperature of the reaction mixture, the extended 
primers will dissociate from the marker to form reaction products, excess primers will bind to the 
marker and to the reaction products and the process is repeated. 

A reverse transcriptase PCR amplification procedure may be performed in order to 
quantify the amount of mRNA amplified. Methods of reverse transcribing KNA into cDNA are 
well known and described in Sambrook ei al, 1989. Alternative methods for reverse 
transcription utilize thermostable, RNA-dependent DNA polymerases. These methods are 
described in WO 90/07641 filed December 21, 1990. Polymerase chain reaction methodologies 
are well known in the art. 

Another method for amplification is the ligase chain reaction ("LCR"), disclosed in EPO 
No. 320 308, incorporated herein by reference in its entirety. In LCR, two complementary 
probe pairs are prepared, and in the presence of the target sequence, each pair will bind to 
opposite complementary strands of the target such that they abut. In the presence of a ligase, 
the two probe pairs will link to form a single unit. By temperature cycling, as in PCR, bound 
ligated units dissociate from the target and then serve as "target sequences" for ligation of excess 
probe pairs. U.S. Patent 4,883,750 describes a method similar to LCR for binding probe pairs to 
a target sequence. 

Qbeta Replicase, described in PCT Application No. PCTAJS87/00880, may also be used 
as still another amplification method in the present invention. In this method, a replicative 
sequence of RNA that has a region complementary to that of a target is added to a sample in the 
presence of an RNA polymerase. The polymerase will copy the replicative sequence that can 
then be detected. 

An isothermal amplification method, in which restriction endonucleases and ligases are 
used to achieve the amplification of target molecules that contain nucleotide 5'-[alpha-thio]- 
triphosphates in one strand of a restriction site may also be useful in the amplification of nucleic 
acids in the present invention, Walker et al y (1992). 
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Strand Displacement Amplification (SDA) is another method of carrying out isothermal 
amplification of nucleic acids which involves multiple rounds of strand displacement and 
synthesis, /.e., nick translation. A similar method, called Repair Chain Reaction (RCR), involves 
annealing several probes throughout a region targeted for amplification, followed by a repair 
reaction in which only two of the four bases are present. The other two bases can be added as 
biotinylated derivatives for easy detection. A similar approach is used in SDA. Target specific 
sequences can also be detected using a cyclic probe reaction (CPR). In CPR, a probe having 3' 
and 5' sequences of non-specific DNA and a middle sequence of specific RNA is hybridized to 
DNA that is present in a sample. Upon hybridization, the reaction is treated with RNase H, and 
the products of the probe identified as distinctive products that are released after digestion. The 
original template is annealed to another cycling probe and the reaction is repeated. 

Still another amplification methods described in GB Application No. 2 202 328, and in 
PCT Application No. PCT/US89/01025, each of which is incorporated herein by reference in its 
entirety, may be used in accordance with the present invention. In the former application, 
"modified" primers are used in a PCR-Iike, template- and enzyme-dependent synthesis. The 
primers may be modified by labeling with a capture moiety (e.g., biotin) and/or a detector moiety 
(e.g., enzyme). In the latter application, an excess of labeled probes are added to a sample. In 
the presence of the target sequence, the probe binds and is cleaved catalytically. After cleavage, 
the target sequence is released intact to be bound by excess probe. Cleavage of the labeled 
probe signals the presence of the target sequence. 

Other nucleic acid amplification procedures include transcription-based amplification 
systems (TAS), including nucleic acid sequence based amplification (NASBA) and 3SR (Kwoh 
et al, 1989; Gingeras et al, PCT Application WO 88/103 15, incorporated herein by reference in 
their entirety). In NASBA, the nucleic acids can be prepared for amplification by standard 
phenol/chloroform extraction, heat denaturation of a clinical sample, treatment with lysis buffer 
and minispin columns for isolation of DNA and RNA or guanidinium chloride extraction of 
RNA. These amplification techniques involve annealing a primer which has target specific 
sequences. Following polymerization, DNA/RNA hybrids are digested with RNase H while 
double stranded DNA molecules are heat denatured again. In either case the single stranded 
DNA is made fully double stranded by addition of second target specific primer, followed by 
polymerization. The double-stranded DNA molecules are then multiply transcribed by an RNA 
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polymerase such as T7 or SP6. In an isothermal cyclic reaction, the RNA's are reverse 
transcribed into single stranded DNA, which is then converted to double stranded DNA, and 
then transcribed once again with an RNA polymerase such as T7 or SP6. The resulting 
products, whether truncated or complete, indicate target specific sequences. 

5 Davey et al. y EPO No. 329 822 (incorporated herein by reference in its entirety) disclose 

a nucleic acid amplification process involving cyclically synthesizing single-stranded RNA 
("ssRNA"), ssDNA, and double-stranded DNA (dsDNA), which may be used in accordance with 
the present invention. The ssRNA is a template for a first primer oligonucleotide, which is 
elongated by reverse transcriptase (RNA-dependent DNA polymerase). The RNA is then 

10 removed from the resulting DNA:RNA duplex by the action of ribonuclease H (RNase H, an 
RNase specific for RNA in duplex with either DNA or RNA). The resultant ssDNA is a 
template for a second primer, which also includes the sequences of an RNA polymerase 
promoter (exemplified by T7 RNA polymerase) 5' to its homology to the template. This primer 
is then extended by DNA polymerase (exemplified by the large "Klenow" fragment of E. coli 

15 DNA polymerase I), resulting in a double-stranded DNA ("dsDNA") molecule, having a 
sequence identical to that of the original RNA between the primers and having additionally, at 
one end, a promoter sequence. This promoter sequence can be used by the appropriate RNA 
polymerase to make many RNA copies of the DNA. These copies can then re-enter the cycle 
leading to very swift amplification. With proper choice of enzymes, this amplification can be 

20 done isothermally without addition of enzymes at each cycle. Because of the cyclical nature of 
this process, the starting sequence can be chosen to be in the form of either DNA or RNA. 

Miller et aL, PCT Application WO 89/06700 (incorporated herein by reference in its 
entirety) disclose a nucleic acid sequence amplification scheme based on the hybridization of a 
promoter/primer sequence to a target single-stranded DNA ("ssDNA") followed by transcription 
25 of many RNA copies of the sequence. This scheme is not cyclic, i.e., new templates are not 
produced from the resultant RNA transcripts. Other amplification methods include "RACE" and 
"one-sided PCR" (Frohman, MA., In: PCR PROTOCOLS: A GUIDE TO METHODS AND 
APPLICATIONS, Academic Press, N.Y., 1990; Ohara et a/., 1989; each herein incorporated by 
reference in their entirety). 
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Methods based on ligation of two (or more) oligonucleotides in the presence of nucleic 
acid having the sequence of the resulting "di-oligonucleotide", thereby amplifying the di- 
oligonucleotide, may also be used in the amplification step of the present invention. Wu et al, 
(1989), incorporated herein by reference in its entirety. 

(Hi) Southern/Northern Blotting 

Blotting techniques are well known to those of skill in the art. Southern blotting involves 
the use of DNA as a target, whereas Northern blotting involves the use of RNA as a target. 
Each provide different types of information, although cDNA blotting is analogous, in many 
aspects, to blotting or RNA species. 

Briefly, a probe is used to target a DNA or RNA species that has been immobilized on a 
suitable matrix, often a filter of nitrocellulose. The different species should be spatially separated 
to facilitate analysis. This often is accomplished by gel electrophoresis of nucleic acid species 
followed by "blotting" on to the filter. 

Subsequently, the blotted target is incubated with a probe (usually labeled) under 
conditions that promote denaturation and rehybridization. Because the probe is designed to base 
pair with the target, the probe will binding a portion of the target sequence under renaturing 
conditions. Unbound probe is then removed, and detection is accomplished as described above. 

(iv) Separation Methods 

It normally is desirable, at one stage or another, to separate the amplification product 
from the template and the excess primer for the purpose of determining whether specific 
amplification has occurred. In one embodiment, amplification products are separated by 
agarose, agarose-acrylamide or polyacrylamide gel electrophoresis using standard methods. See 
Sambrookef a/., 1989. 

Alternatively, chromatographic techniques may be employed to effect separation. There 
are many kinds of chromatography which may be used in the present invention: adsorption, 
partition, ion-exchange and molecular sieve, and many specialized techniques for using them 
including column, paper, thin-layer and gas chromatography (Freifelder, 1982). 
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(v) Detection Methods 

Products may be visualized in order to confirm amplification of the marker sequences. 
One typical visualization method involves staining of a gel with ethidium bromide and 
visualization under UV light. Alternatively, if the amplification products are integrally labeled 
5 with radio- or fluorometrically-labeled nucleotides, the amplification products can then be 
exposed to x-ray film or visualized under the appropriate stimulating spectra, following 
separation. 

In one embodiment, visualization is achieved indirectly. Following separation of 
amplification products, a labeled nucleic acid probe is brought into contact with the amplified 
10 marker sequence. The probe preferably is conjugated to a chromophore but may be 
radiolabeled. In another embodiment, the probe is conjugated to a binding partner, such as an 
antibody or biotin, and the other member of the binding pair carries a detectable moiety. 

In one embodiment, detection is by a labeled probe. The techniques involved are well 
known to those of skill in the art and can be found in many standard books on molecular 
15 protocols. See Sambrook et al. y 1989. For example, chromophore or radiolabel probes or 
primers identify the target during or following amplification. 

One example of the foregoing is described in U.S. Patent No. 5,279,721, incorporated by 
reference herein, which discloses an apparatus and method for the automated electrophoresis and 
transfer of nucleic acids. The apparatus permits electrophoresis and blotting without external 
20 manipulation of the gel and is ideally suited to carrying out methods according to the present 
invention. 

In addition, the amplification products described above may be subjected to sequence 
analysis to identify specific kinds of variations using standard sequence analysis techniques. 
Within certain methods, exhaustive analysis of genes is carried out by sequence analysis using 
25 primer sets designed for optimal sequencing (Pignon et al, 1994). The present invention 
provides methods by which any or all of these types of analyses may be used. Using the 
sequences disclosed herein, oligonucleotide primers may be designed to permit the amplification 
of sequences throughout the TS10q23,3 gene that may then be analyzed by direct sequencing. 
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(vi) Kit Components 

All the essential materials and reagents required for detecting and sequencing TS10q23.3 
and variants thereof may be assembled together in a kit. This generally will comprise preselected 
primers and probes. Also included may be enzymes suitable for amplifying nucleic acids 
5 including various polymerases (RT, Taq, Sequenase™ etc.), deoxynucleotides and buffers to 
provide the necessary reaction mixture for amplification. Such kits also generally will comprise, 
in suitable means, distinct containers for each individual reagent and enzyme as well as for each 
primer or probe. 

(vii) Design and Theoretical Considerations for Relative Quantitative 
10 RT-PCR 

Reverse transcription (RT) of RNA to cDNA followed by relative quantitative PCR 
(RT-PCR) can be used to determine the relative concentrations of specific mRNA species 
isolated from patients. By determining that the concentration of a specific mRNA species varies, 
15 it is shown that the gene encoding the specific mRNA species is differentially expressed. 

In PCR, the number of molecules of the amplified target DNA increase by a factor 
approaching two with every cycle of the reaction until some reagent becomes limiting. 
Thereafter, the rate of amplification becomes increasingly diminished until there is no increase in 
the amplified target between cycles. If a graph is plotted in which the cycle number is on the X 

20 axis and the log of the concentration of the amplified target DNA is on the Y axis, a curved line 
of characteristic shape is formed by connecting the plotted points. Beginning with the first cycle, 
the slope of the line is positive and constant. This is said to be the linear portion of the curve. 
After a reagent becomes limiting, the slope of the line begins to decrease and eventually becomes 
zero. At this point the concentration of the amplified target DNA becomes asymptotic to some 

25 fixed value. This is said to be the plateau portion of the curve. 

The concentration of the target DNA in the linear portion of the PCR amplification is 
directly proportional to the starting concentration of the target before the reaction began. By 
determining the concentration of the amplified products of the target DNA in PCR reactions that 
have completed the same number of cycles and are in their linear ranges, it is possible to 
30 determine the relative concentrations of the specific target sequence in the original DNA 
mixture. If the DNA mixtures are cDNAs synthesized from RNAs isolated from different tissues 
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or cells, the relative abundances of the specific mRNA from which the target sequence was 
derived can be determined for the respective tissues or cells. This direct proportionality between 
the concentration of the PCR products and the relative mRNA abundances is only true in the 
linear range of the PCR reaction. 

5 The final concentration of the target DNA in the plateau portion of the curve is 

determined by the availability of reagents in the reaction mix and is independent of the original 
concentration of target DNA. Therefore, the first condition that must be met before the relative 
abundances of a mRNA species can be determined by RT-PCR for a collection of RNA 
populations is that the concentrations of the amplified PCR products must be sampled when the 

10 PCR reactions are in the linear portion of their curves. 

The second condition that must be met for an RT-PCR experiment to successfully 
determine the relative abundances of a particular mRNA species is that relative concentrations of 
the amplifiable cDNAs must be normalized to some independent standard. The goal of an RT- 
PCR experiment is to determine the abundance of a particular mRNA species relative to the 
15 average abundance of all mRNA species in the sample. In the experiments described below, 
mRNAs for B-actin, asparagine synthetase and lipocortin II v/ere used as external and internal 
standards to which the relative abundance of other mRNAs are compared. 

Most protocols for competitive PCR utilize internal PCR standards that are 
approximately as abundant as the target. These strategies are effective if the products of the PCR 

20 amplifications are sampled during their linear phases. If the products are sampled when the 
reactions are approaching the plateau phase, then the less abundant product becomes relatively 
over represented. Comparisons of relative abundances made for many different RNA samples, 
such as is the case when examining RNA samples for differential expression, become distorted in 
such a way as to make differences in relative abundances of RNAs appear less than they actually 

25 are. This is not a significant problem if the internal standard is much more abundant than the 
target. If the internal standard is more abundant than the target, then direct linear comparisons 
can be made between RNA samples. 

The above discussion describes theoretical considerations for an RT-PCR assay for 
clinically derived materials. The problems inherent in clinical samples are that they are of 
variable quantity (making normalization problematic), and that they are of variable quality 
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(necessitating the co-amplification of a reliable internal control, preferably of larger size than the 
target). Both of these problems are overcome if the RT-PCR is performed as a relative 
quantitative RT-PCR with an internal standard in which the internal standard is an amplifiable 
cDNA fragment that is larger than the target cDNA fragment and in which the abundance of the 
mRNA encoding the internal standard is roughly 5-100 fold higher than the mRNA encoding the 
target. This assay measures relative abundance, not absolute abundance of the respective mRNA 
species. 

Other studies may be performed using a more conventional relative quantitative RT-PCR 
assay with an external standard protocol. These assays sample the PCR products in the linear 
portion of their amplification curves. The number of PCR cycles that are optimal for sampling 
must be empirically determined for each target cDNA fragment. In addition, the reverse 
transcriptase products of each RNA population isolated from the various tissue samples must be 
carefully normalized for equal concentrations of amplifiable cDNAs. This consideration is very 
important since the assay measures absolute mRNA abundance. Absolute mRNA abundance can 
be used as a measure of differential gene expression only in normalized samples. While empirical 
determination of the linear range of the amplification curve and normalization of cDNA 
preparations are tedious and time consuming processes, the resulting RT-PCR assays can be 
superior to those derived from the relative quantitative RT-PCR assay with an internal standard. 

One reason for this advantage is that without the internal standard/competitor, all of the 
reagents can be converted into a single PCR product in the linear range of the amplification 
curve, thus increasing the sensitivity of the assay. Another reason is that with only one PCR 
product, display of the product on an electrophoretic gel or another display method becomes less 
complex, has less background and is easier to interpret. 

(viii) Chip Technologies 

Specifically contemplated by the present inventors are chip-based DNA technologies 
such as those described by Hacia et al (1996) and Shoemaker et al (1996). Briefly, these 
techniques involve quantitative methods for analyzing large numbers of genes rapidly and 
accurately. By tagging genes with oligonucleotides or using fixed probe arrays, one can employ 
chip technology to segregate target molecules as high density arrays and screen these molecules 
on the basis of hybridization. See also Pease et al (1994); Fodor et al (1991). 



WO 98/33907 



-65- 



PCT/US98/00353 



B. Immunodiagnosis 

Antibodies of the present invention can be used in characterizing the TS10q23.3 content 
of healthy and diseased tissues, through techniques such as ELIS As and Western blotting. This 
may provide a screen for the presence or absence of malignancy or as a predictor of future 
5 cancer. 

The use of antibodies of the present invention, in an ELISA assay is contemplated. For 
example, anti-TS10q23.3 antibodies are immobilized onto a selected surface, preferably a surface 
exhibiting a protein affinity such as the wells of a polystyrene microtiter plate. After washing to 
remove incompletely adsorbed material, it is desirable to bind or coat the assay plate wells with a 
10 non-specific protein that is known to be antigenically neutral with regard to the test antisera such 
as bovine serum albumin (BSA), casein or solutions of powdered milk. This allows for blocking 
of non-specific adsorption sites on the immobilizing surface and thus reduces the background 
caused by non-specific binding of antigen onto the surface. 

After binding of antibody to the well, coating with a non-reactive material to reduce 
15 background, and washing to remove unbound material, the immobilizing surface is contacted 
with the sample to be tested in a manner conducive to immune complex (antigen/antibody) 
formation. 

Following formation of specific immunocomplexes between the test sample and the 
bound antibody, and subsequent washing, the occurrence and even amount of immunocomplex 

20 formation may be determined by subjecting same to a second antibody having specificity for 
TS10q23.3 that differs the first antibody. Appropriate conditions preferably include diluting the 
sample with diluents such as BSA, bovine gamma globulin (BGG) and phosphate buffered saline 
(PBS)/Tween <R> . These added agents also tend to assist in the reduction of nonspecific 
background. The layered antisera is then allowed to incubate for from about 2 to about 4 hr, at 

25 temperatures preferably on the order of about 25° to about 27°C. Following incubation, the 
antisera-contacted surface is washed so as to remove non-immunocomplexed material. A 
preferred washing procedure includes washing with a solution such as PBS/Tween®, or borate 
buffer. 
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To provide a detecting means, the second antibody will preferably have an associated 
enzyme that will generate a color development upon incubating with an appropriate chromogenic 
substrate. Thus, for example, one will desire to contact and incubate the second antibody-bound 
surface with a urease or peroxidase-conjugated anti-human IgG for a period of time and under 
conditions which favor the development of immunocomplex formation (e.g., incubation for 2 hr 
at room temperature in a PBS-containing solution such as PBS/Tween®). 

After incubation with the second enzyme-tagged antibody, and subsequent to washing to 
remove unbound material, the amount of label is quantified by incubation with a chromogenic 
substrate such as urea and bromocresol purple or 2,2'-azino-di-(3-ethyl-benzthiazoline)-6- 
sulfonic acid (ABTS) and H 2 0 2 , in the case of peroxidase as the enzyme label. Quantitation is 
then achieved by measuring the degree of color generation, e.g., using a visible spectrum 
spectrophotometer. 

The preceding format may be altered by first binding the sample to the assay plate. Then, 
primary antibody is incubated with the assay plate, followed by detecting of bound primary 
antibody using a labeled second antibody with specificity for the primary antibody. 

The antibody compositions of the present invention will find great use in immunoblot or 
Western blot analysis. The antibodies may be used as high-affinity primary reagents for the 
identification of proteins immobilized onto a solid support matrix, such as nitrocellulose, nylon 
or combinations thereof. In conjunction with immunoprecipitation, followed by gel 
electrophoresis, these may be used as a single step reagent for use in detecting antigens against 
which secondary reagents used in the detection of the antigen cause an adverse background. 
Immunologically-based detection methods for use in conjunction with Western blotting include 
enzymatically-, radiolabel-, or fluorescently-tagged secondary antibodies against the toxin moiety 
are considered to be of particular use in this regard. 

V. Methods for Screening Active Compounds 

The present invention also contemplates the use of TS10q23.3 and active fragments, and 
nucleic acids coding therefor, in the screening of compounds for activity in either stimulating 
TS10q23.3 activity, overcoming the lack of TS10q23.3 or blocking the effect of a mutant 
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TS10q23.3 molecule. These assays may make use of a variety of different formats and may 
depend on the kind of "activity" for which the screen is being conducted. Contemplated 
functional "read-outs" include binding to a compound, inhibition of binding to a substrate, 
ligand, receptor or other binding partner by a compound, phosphatase activity, anti-phosphatase 
5 activity, phosphorylation of TS10q23.3, dephosphorylation of TS10q23.3, inhibition or 
stimulation of cell-to-cell signaling, growth, metastasis, cell division, cell migration, soft agar 
colony formation, contact inhibition, invasiveness, angiogenesis, apoptosis, tumor progression or 
other malignant phenotype. 

A. In Vitro Assays 

10 In one embodiment, the invention is to be applied for the screening of compounds that 

bind to the TS10q23.3 molecule or fragment thereof. The polypeptide or fragment may be either 
free in solution, fixed to a support, expressed in or on the surface of a cell. Either the 
polypeptide or the compound may be labeled, thereby permitting determining of binding. 

In another embodiment, the assay may measure the inhibition of binding of TS10q23.3 to 
15 a natural or artificial substrate or binding partner. Competitive binding assays can be performed 
in which one of the agents (TS10q23.3, binding partner or compound) is labeled. Usually, the 
polypeptide will be the labeled species. One may measure the amount of free label versus bound 
* label to determine binding or inhibition of binding. 

Another technique for high throughput screening of compounds is described in WO 
20 84/03564. Large numbers of small peptide test compounds are synthesized on a solid substrate, 
such as plastic pins or some other surface. The peptide test compounds are reacted with 
TS10q23.3 and washed. Bound polypeptide is detected by various methods. 

Purified TS10q23.3 can be coated directly onto plates for use in the aforementioned drug 
screening techniques. However, non-neutralizing antibodies to the polypeptide can be used to 
25 immobilize the polypeptide to a solid phase. Also, fusion proteins containing a reactive region 
(preferably a terminal region) may be used to link the TS10q23.3 active region to a solid phase. 

Various cell lines containing wild-type or natural or engineered mutations in TS10q23.3 
can be used to study various functional attributes of TS10q23.3 and how a candidate compound 
affects these attributes. Methods for engineering mutations are described elsewhere in this 
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document, as are naturally-occurring mutations in TS10q23.3 that lead to, contribute to and/or 
otherwise cause malignancy. In such assays, the compound would be formulated appropriately, 
given its biochemical nature, and contacted with a target cell. Depending on the assay, culture 
may be required. The cell may then be examined by virtue of a number of different physiologic 
5 assays. Alternatively, molecular analysis may be performed in which the function of TS 1 0q23 .3, 
or related pathways, may be explored. This may involve assays such as those for protein 
expression, enzyme function, substrate utilization, phosphorylation states of various molecules 
including TS10q23.3, cAMP levels, mRNA expression (including differential display of whole 
cell or polyA RNA) and others. 

10 B. In Vivo Assays 

The present invention also encompasses the use of various animal models. Here, the 
identity seen between human and mouse TS10q23.3 provides an excellent opportunity to 
examine the function of TS10q23.3 in a whole animal system where it is normally expressed. By 
developing or isolating mutant cells lines that fail to express normal TS10q23.3, one can 

15 generate cancer models in mice that will be highly predictive of cancers in humans and other 
mammals. These models may employ the orthotopic or systemic administration of tumor cells to 
mimic primary and/or metastatic cancers. Alternatively, one may induce cancers in animals by 
providing agents known to be responsible for certain events associated with malignant 
transformation and/or tumor progression. Finally, transgenic animals (discussed below) that lack 

20 a wild-type TS 1 0q23 .3 may be utilized as models for cancer development and treatment. 

Treatment of animals with test compounds will involve the administration of the 
compound, in an appropriate form, to the animal. Administration will be by any route the could 
be utilized for clinical or non-clinical purposes, including but not limited to oral, nasal, buccal, 
rectal, vaginal or topical. Alternatively, administration may be by intratracheal instillation, 
25 bronchial instillation, intradermal, subcutaneous, intramuscular, intraperitoneal or intravenous 
injection. Specifically contemplated are systemic intravenous injection/regional administration 
via blood or lymph supply and intratumoral injection. 

Determining the effectiveness of a compound in vivo may involve a variety of different 
criteria. Such criteria include, but are not limited to, survival, reduction of tumor burden or 
30 mass, arrest or slowing of tumor progression, elimination of tumors, inhibition or prevention of 
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metastasis, increased activity level, improvement in immune effector function and improved food 
intake. 

C. Rational Drug Design 

The goal of rational drug design is to produce structural analogs of biologically active 
5 polypeptides or compounds with which they interact (agonists, antagonists, inhibitors, binding 
partners, etc.). By creating such analogs, it is possible to fashion drugs which are more active or 
stable than the natural molecules, which have different susceptibility to alteration or which may 
affect the function of various other molecules. In one approach, one would generate a three- 
dimensional structure for TS10q23.3 or a fragment thereof This could be accomplished by x- 
10 ray crystallograph, computer modeling or by a combination of both approaches. An alternative 
approach, "alanine scan/' involves the random replacement of residues throughout molecule with 
alanine, and the resulting affect on function determined. 

It also is possible to isolate a TS10q23.3 specific antibody, selected by a functional assay, 
and then solve its crystal structure. In principle, this approach yields a pharmacore upon which 

15 subsequent drug design can be based. It is possible to bypass protein crystallograph altogether 
by generating anti-idiotypic antibodies to a functional, pharmacologically active antibody. As a 
mirror image of a mirror image, the binding site of anti-idiotype would be expected to be an 
analog of the original antigen. The anti-idiotype could then be used to identify and isolate 
peptides from banks of chemically- or biologically-produced peptides. Selected peptides would 

20 then serve as the pharmacore. Anti-idiotypes may be generated using the methods described 
herein for producing antibodies, using an antibody as the antigen. 

Thus, one may design drugs which have improved TS10q23.3 activity or which act as 
stimulators, inhibitors, agonists, antagonists or TS10q23.3 or molecules affected by TS10q23.3 
function. By virtue of the availability of cloned TS0q23.3 sequences, sufficient amounts of 
25 TS10q23.3 can be produced to perform crystallographic studies. In addition, knowledge of the 
polypeptide sequences permits computer employed predictions of structure-function 
relationships. 
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VI. Methods for Treating 10q23.3 Related Malignancies 

The present invention also involves, in another embodiment, the treatment of cancer. 
The types of cancer that may be treated, according to the present invention, is limited only by the 
involvement of TS10q23.3. By involvement, it is not even a requirement that TS10q23.3 be 
5 mutated or abnormal - the overexpression of this tumor suppressor may actually overcome other 
lesions within the cell. Thus, it is contemplated that a wide variety of tumors may be treated 
using TS10q23.3 therapy, including cancers of the brain (glioblastoma, astrocytoma, 
oligodendroglioma, ependymomas), lung, liver, spleen, kidney, lymph node, pancreas, small 
intestine, blood cells, colon, stomach, breast, endometrium, prostate, testicle, ovary, skin, head 
10 and neck, esophagus, bone marrow, blood or other tissue. 

In many contexts, it is not necessary that the tumor cell be killed or induced to undergo 
normal cell death or "apoptosis." Rather, to accomplish a meaningful treatment, all that is 
required is that the tumor growth be slowed to some degree. It may be that the tumor growth is 
completely blocked, however, or that some tumor regression is achieved. Clinical terminology 
15 such as "remission" and "reduction of tumor" burden also are contemplated given their normal 
usage. 

A. Genetic Based Therapies 

One of the therapeutic embodiments contemplated by the present inventors is the 
intervention, at the molecular level, in the events involved in the tumorigenesis of some cancers. 

20 Specifically, the present inventors intend to provide, to a cancer cell, an expression construct 
capable of providing TS10q23.3 to that cell. Because the human, mouse and dog genes all 
encode the same polypeptide, any of these nucleic acids could be used in human therapy, as 
could any of the gene sequence variants discussed above which would encode the same, or a 
biologically equivalent polypeptide. The lengthy discussion of expression vectors and the 

25 genetic elements employed therein is incorporated into this section by reference. Particularly 
preferred expression vectors are viral vectors such as adenovirus, adeno-associated virus, 
herpesvirus, vaccinia virus and retrovirus. Also preferred is liposomally-encapsulated expression 
vector. 
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Those of skill in the art are well aware of how to apply gene delivery to in vivo and ex 
vivo situations. For viral vectors, one generally will prepare a viral vector stock. Depending on 
the kind of virus and the titer attainable, one will deliver 1 X 10 4 , 1 X 10 5 , 1 X 10 6 , 1 X 10 7 , 1 X 
10 8 , 1 X 10 9 , 1 X 10 10 , 1 X 10 u or 1 X 10 12 infectious particles to the patient. Similar figures 
5 may be extrapolated for liposomal or other non-viral formulations by comparing relative uptake 
efficiencies. Formulation as a pharmaceutically acceptable composition is discussed below. 

Various routes are contemplated for various tumor types. The section below on routes 
contains an extensive list of possible routes. For practically any tumor, systemic delivery is 
contemplated. This will prove especially important for attacking microscopic or metastatic 

10 cancer. Where discrete tumor mass may be identified, a variety of direct, local and regional 
approaches may be taken. For example, the tumor may be directly injected with the expression 
vector. A tumor bed may be treated prior to, during or after resection. Following resection, one 
generally will deliver the vector by a catheter left in place following surgery. One may utilize the 
tumor vasculature to introduce the vector into the tumor by injecting a supporting vein or artery. 

15 A more distal blood supply route also may be utilized. 

In a different embodiment, ex vivo gene therapy is contemplated. This approach is 
particularly suited, although not limited, to treatment of bone marrow associated cancers. In an 
ex vivo embodiment, cells from the patient are removed and maintained outside the body for at 
least some period of time. During this period, a therapy is delivered, after which the cells are 
20 reintroduced into the patient; hopefully, any tumor cells in the sample have been killed. 

Autologous bone marrow transplant (ABMT) is an example of ex vivo gene therapy. 
Basically, the notion behind ABMT is that the patient will serve as his or her own bone marrow 
donor. Thus, a normally lethal dose of irradiation or chemotherapeutic may be delivered to the 
patient to kill tumor cells, and the bone marrow repopulated with the patients own cells that 
25 have been maintained (and perhaps expanded) ex vivo. Because, bone marrow often is 
contaminated with tumor cells, it is desirable to purge the bone marrow of these cells. Use of 
gene therapy to accomplish this goal is yet another way TS10q23.3 may be utilized according to 
the present invention. 
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B. Immunotherapies 

Immunotherapeutics, generally, rely on the use of immune effector cells and molecules to 
target and destroy cancer cells. The immune effector may be, for example, an antibody specific 
for some marker on the surface of a tumor cell. The antibody alone may serve as an effector of 
5 therapy or it may recruit other cells to actually effect cell killing. The antibody also may be 
conjugated to a drug or toxin (chemotherapeutic, radionuclide, ricin A chain, cholera toxin, 
pertussis toxin, etc.) and serve merely as a targeting agent. Alternatively, the effector may be a 
lymphocyte carrying a surface molecule that interacts, either directly or indirectly, with a tumor 
cell target. Various effector cells include cytotoxic T cells and NK cells. 

10 According to the present invention, it is unlikely that TS10q23.3 could serve as a target 

for an immune effector given that (i) it is unlikely to be expressed on the surface of the cell and 
(ii) that the presence, not absence, of TS10q23.3 is associated with the normal state. However, 
it is possible that particular mutant forms of TS10q23.3 may be targeted by immunotherapy, 
either using antibodies, antibody conjugates or immune effector cells. 

15 A more likely scenario is that immunotherapy could be used as part of a combined 

therapy, in conjunction with TS10q23.3-targeted gene therapy. The general approach for 
combined therapy is discussed below. Generally, the tumor cell must bear some marker that is 
amenable to targeting, i.e., is not present on the majority of other cells. Many tumor marker 
exist and any of these may be suitable for targeting in the context of the present invention. 

20 Common tumor markers include carcinoembryonic antigen, prostate specific antigen, urinary 
tumor associated antigen, fetal antigen, tyrosinase (p97), gp68, TAG-72, HMFG, Sialyl Lewis 
Antigen, MucA, MucB, PLAP, estrogen receptor, laminin receptor, erb B and pi 55. 

C. Protein Therapy 

Another therapy approach is the provision, to a subject, of TS10q23.3 polypeptide, 
25 active fragments, synthetic peptides, mimetics or other analogs thereof. The protein may be 
produced by recombinant expression means or, if small enough, generated by an automated 
peptide synthesizer. Formulations would be selected based on the route of administration and 
purpose including, but not limited to, liposomal formulations and classic pharmaceutical 
preparations. 



PCT/US98/00353 



WO 98/33907 PCT/US98/00353 

-73- 

D. Combined Therapy with Immunotherapy, Traditional Chemo- or 
Radiotherapy 

Tumor cell resistance to DNA damaging agents represents a major problem in clinical 
oncology. One goal of current cancer research is to find ways to improve the efficacy of chemo- and 

5 radiotherapy. One way is by combining such traditional therapies with gene therapy. For example, 
the herpes simplex-thymidine kinase (HS-/£) gene, when delivered to brain tumors by a retroviral 
vector system, successfully induced susceptibility to the antiviral agent ganciclovir (Culver et al. y 
1992). In the context of the present invention, it is contemplated that TS10q23.3 replacement 
therapy could be used similarly in conjunction with chemo- or radiotherapeutic intervention. It also 

1 0 may prove effective to combine TS 1 0q23 .3 gene therapy with immunotherapy, as described above. 

To kill cells, inhibit cell growth, inhibit metastasis, inhibit angiogenesis or otherwise reverse or 
reduce the malignant phenotype of tumor cells, using the methods and compositions of the present 
invention, one would generally contact a "target" cell with a TS10q23.3 expression construct and at 

15 least one other agent. These compositions would be provided in a combined amount effective to kill 
or inhibit proliferation of the cell. This process may involve contacting the cells with the expression 
construct and the agent(s) or factor(s) at the same time. This may be achieved by contacting the cell 
with a single composition or pharmacological formulation that includes both agents, or by contacting 
the cell with two distinct compositions or formulations, at the same time, wherein one composition 

20 includes the expression construct and the other includes the agent. 

Alternatively, the gene therapy treatment may precede or follow the other agent treatment by 
intervals ranging from minutes to weeks. In embodiments where the other agent and expression 
construct are applied separately to the cell, one would generally ensure that a significant period of 

25 time did not expire between the time of each delivery, such that the agent and expression construct 
would still be able to exert an advantageously combined effect on the cell. In such instances, it is 
contemplated that one would contact the cell with both modalities within about 12-24 hours of each 
other and, more preferably, within about 6-12 hours of each other, with a delay time of only about 12 
hours being most preferred. In some situations, it may be desirable to extend the time period for 

30 treatment significantly, however, where several days (2, 3, 4, 5, 6 or 7) to several weeks (1, 2, 3, 4, 5, 
6, 7 or 8) lapse between the respective administrations. 
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5 



It also is conceivable that more than one administration of either TS10q23.3 or the other 
agent will be desired. Various combinations may be employed, where TS10q23.3 is "A" and the 
other agent is "B", as exemplified below: 



A/B/A B/A/B B/B/A A/A/B B/A/A A/B/B B/B/B/A B/B/A/B 

A/A/B/B A/B/A/B A/B/B/A B/B/A/A B/A/B/A B/A/A/B B/B/B/A 

10 A/A/A/B B/A/A/A A/B/A/A A/Am/A A/B/B/B B/A/B/B B/B/A/B 

Other combinations are contemplated. Again, to achieve cell killing, both agents are delivered to a 
cell in a combined amount effective to kill the cell. 

15 Agents or factors suitable for use in a combined therapy are any chemical compound or 

treatment method that induces DNA damage when applied to a cell. Such agents and factors include 
radiation and waves that induce DNA damage such as, y-irradiation, X-rays, UV-irradiation, 
microwaves, electronic emissions, and the like. A variety of chemical compounds, also described as 
"chemotherapeutic agents," function to induce DNA damage, all of which are intended to be of use in 

20 the combined treatment methods disclosed herein. Chemotherapeutic agents contemplated to be of 
use, include, e.g., adriamycin, 5-fluorouracil (5FU), etoposide (VP-16), camptothecin, 
actinomycin-D, mitomycin C, cisplatin (CDDP) and even hydrogen peroxide. The invention also 
encompasses the use of a combination of one or more DNA damaging agents, whether radiation- 
based or actual compounds, such as the use of X-rays with cisplatin or the use of cisplatin with 

25 etoposide. In certain embodiments, the use of cisplatin in combination with a TS10q23.3 expression 
construct is particularly preferred as this compound. 

In treating cancer according to the invention, one would contact the tumor cells with an agent 
in addition to the expression construct. This may be achieved by irradiating the localized tumor site 
30 with radiation such as X-rays, UV-light, y-rays or even microwaves. Alternatively, the tumor cells 
may be contacted with the agent by aaministering to the subject a therapeutically effective amount of 
a pharmaceutical composition comprising a compound such as, adriamycin, 5-fluorouracil, etoposide, 
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camptothecin, actinomycin-D, mitomycin C, or more preferably, cisplatin. The agent may be 
prepared and used as a combined therapeutic composition, or kit, by combining it with a TS10q23.3 
expression construct, as described above. 

Agents that directly cross-link nucleic acids, specifically DNA, are envisaged to facilitate 
DNA damage leading to a synergistic, antineoplastic combination with TS10q23.3. Agents such as 
cisplatin, and other DNA alkylating agents may be used. Cisplatin has been widely used to treat 
cancer, with efficacious doses used in clinical applications of 20 mg/m 2 for 5 days every three weeks 
for a total of three courses. Cisplatin is not absorbed orally and must therefore be delivered via 
injection intravenously, subcutaneously, intratumorally or intraperitoneally. 

Agents that damage DNA also include compounds that interfere with DNA replication, 
mitosis and chromosomal segregation. Such chemotherapeutic compounds include adriamycin, also 
known as doxorubicin, etoposide, verapamil, podophyllotoxin, and the like. Widely used in a clinical 
setting for the treatment of neoplasms, these compounds are administered through bolus injections 
intravenously at doses ranging from 25-75 mg/m 2 at 21 day intervals for adriamycin, to 35-50 mg/m 2 
for etoposide intravenously or double the intravenous dose orally. 

Agents that disrupt the synthesis and fidelity of nucleic acid precursors and subunits also lead 
to DNA damage. As such a number of nucleic acid precursors have been developed. Particularly 
useful are agents that have undergone extensive testing and are readily available. As such, agents 
such as 5-fluorouracil (5-FU), are preferentially used by neoplastic tissue, making this agent 
particularly useful for targeting to neoplastic cells. Although quite toxic, 5-FU, is applicable in a wide 
range of carriers, including topical, however intravenous administration with doses ranging from 3 to 
15 mg/kg/day being commonly used. 

Other factors that cause DNA damage and have been used extensively include what are 
commonly known as y-rays, X-rays, and/or the directed delivery of radioisotopes to tumor cells. 
Other forms of DNA damaging factors are also contemplated such as microwaves and UV- 
irradiation. It is most likely that all of these factors effect a broad range of damage DNA, on the 
precursors of DNA, the replication and repair of DNA, and the assembly and maintenance of 
chromosomes. Dosage ranges for X-rays range from daily doses of 50 to 200 roentgens for 
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prolonged periods of time (3 to 4 weeks), to single doses of 2000 to 6000 roentgens. Dosage ranges 
for radioisotopes vary widely, and depend on the half-life of the isotope, the strength and type of 
radiation emitted, and the uptake by the neoplastic cells. 

The skilled artisan is directed to "Remington's Pharmaceutical Sciences" 15th Edition, chapter 
33, in particular pages 624-652. Some variation in dosage will necessarily occur depending on the 
condition of the subject being treated. The person responsible for administration will, in any event, 
determine the appropriate dose for the individual subject. Moreover, for human administration, 
preparations should meet sterility, pyrogenicity, general safety and purity standards as required by 
FDA Office of Biologies standards. 

The inventors propose that the regional delivery of TS10q23.3 expression constructs to 
patients with 10q23.3-linked cancers will be a very efficient method for delivering a therapeutically 
effective gene to counteract the clinical disease. Similarly, the chemo- or radiotherapy may be 
directed to a particular, affected region of the subjects body. Alternatively, systemic delivery of 
expression construct and/or the agent may be appropriate in certain circumstances, for example, 
where extensive metastasis has occurred. 

In addition to combining TS10q23.3-targeted therapies with chemo- and radiotherapies; it 
also is contemplated that combination with other gene therapies will be advantageous. For example, 
targeting of TS10q23.3 and p53 or pl6 mutations at the same time may produce an improved anti- 
cancer treatment. Any other tumor-related gene conceivably can be targeted in this manner, for 
example, p21, Rb, APC, DCC, NF-1, NF-2, BCRA2, pl6, FHIT, WT-1, MEN-I, MEN-II, BRCA1, 
VHL, FCC, MCC, ras, myc, neu, rqf, erb, src.finsjun, trk, ret, gsp, hst, bclaoAabl. 

It also should be pointed out that any of the foregoing therapies may prove useful by 
themselves in treating a TS10q23.3. In this regard, reference to chemotherapeutics and non- 
TS 10q23.3 gene therapy in combination should also be read as a contemplation that these approaches 
may be employed separately. 
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E. Formulations and Routes for Administration to Patients 

Where clinical applications are contemplated, it will be necessary to prepare 
pharmaceutical compositions - expression vectors, virus stocks, proteins, antibodies and drugs - 
in a form appropriate for the intended application. Generally, this will entail preparing 
compositions that are essentially free of pyrogens, as well as other impurities that could be 
harmful to humans or animals. 

One will generally desire to employ appropriate salts and buffers to render delivery 
vectors stable and allow for uptake by target cells. Buffers also will be employed when 
recombinant cells are introduced into a patient. Aqueous compositions of the present invention 
comprise an effective amount of the vector to cells, dissolved or dispersed in a pharmaceutically 
acceptable carrier or aqueous medium. Such compositions also are referred to as inocula. The 
phrase "pharmaceutically or pharmacologically acceptable" refer to molecular entities and 
compositions that do not produce adverse, allergic, or other untoward reactions when 
administered to an animal or a human. As used herein, "pharmaceutically acceptable carrier" 
includes any and all solvents, dispersion media, coatings, antibacterial and antifungal agents, 
isotonic and absorption delaying agents and the like. The use of such media and agents for 
pharmaceutically active substances is well know in the art. Except insofar as any conventional 
media or agent is incompatible with the vectors or cells of the present invention, its use in 
therapeutic compositions is contemplated. Supplementary active ingredients also can be 
incorporated into the compositions. 

The active compositions of the present invention may include classic pharmaceutical 
preparations. Administration of these compositions according to the present invention will be 
via any common route so long as the target tissue is available via that route. This includes oral, 
nasal, buccal, rectal, vaginal or topical. Alternatively, administration may be by orthotopic, 
intradermal, subcutaneous, intramuscular, intraperitoneal or intravenous injection. Such 
compositions would normally be administered as pharmaceutically acceptable compositions, 
described supra. 

The active compounds may also be administered parenterally or intraperitoneally. 
Solutions of the active compounds as free base or pharmacologically acceptable salts can be 
prepared in water suitably mixed with a surfactant, such as hydroxypropylcellulose. Dispersions 
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can also be prepared in glycerol, liquid polyethylene glycols, and mixtures thereof and in oils. 
Under ordinary conditions of storage and use, these preparations contain a preservative to 
prevent the growth of microorganisms. 

The pharmaceutical forms suitable for injectable use include sterile aqueous solutions or 
5 dispersions and sterile powders for the extemporaneous preparation of sterile injectable solutions 
or dispersions. In all cases the form must be sterile and must be fluid to the extent that easy 
syringability exists. It must be stable under the conditions of manufacture and storage and must 
be preserved against the contaminating action of microorganisms, such as bacteria and fungi. 
The carrier can be a solvent or dispersion medium containing, for example, water, ethanol, 

10 polyol (for example, glycerol, propylene glycol, and liquid polyethylene glycol, and the like), 
suitable mixtures thereof and vegetable oils. The proper fluidity can be maintained, for example, 
by the use of a coating, such as lecithin, by the maintenance of the required particle size in the 
case of dispersion and by the use of surfactants. The prevention of the action of microorganisms 
can be brought about by various antibacterial an antifungal agents, for example, parabens, 

15 chlorobutanol, phenol, sorbic acid, thimerosal, and the like. In many cases, it will be preferable 
to include isotonic agents, for example, sugars or sodium chloride. Prolonged absorption of the 
injectable compositions can be brought about by the use in the compositions of agents delaying 
absorption, for example, aluminum monostearate and gelatin. 

Sterile injectable solutions are prepared by incorporating the active compounds in the 
20 required amount in the appropriate solvent with various of the other ingredients enumerated 
above, as required, followed by filtered sterilization. Generally, dispersions are prepared by 
incorporating the various sterilized active ingredients into a sterile vehicle which contains the 
basic dispersion medium and the required other ingredients from those enumerated above. In the 
case of sterile powders for the preparation of sterile injectable solutions, the preferred methods 
25 of preparation are vacuum-drying and freeze-drying techniques which yield a powder of the 
active ingredient plus any additional desired ingredient from a previously sterile-filtered solution 
thereof. 

As used herein, "pharmaceutically acceptable carrier" includes any and all solvents, 
dispersion media, coatings, antibacterial and antifungal agents, isotonic and absorption delaying 
30 agents and the like. The use of such media and agents for pharmaceutical active substances is 
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well known in the art. Except insofar as any conventional media or agent is incompatible with 
the active ingredient, its use in the therapeutic compositions is contemplated. Supplementary 
active ingredients can also be incorporated into the compositions. 

For oral administration the polypeptides of the present invention may be incorporated 
with excipients and used in the form of non-ingestible mouthwashes and dentifrices. A 
mouthwash may be prepared incorporating the active ingredient in the required amount in an 
appropriate solvent, such as a sodium borate solution (Dobell's Solution). Alternatively, the 
active ingredient may be incorporated into an antiseptic wash containing sodium borate, glycerin 
and potassium bicarbonate. The active ingredient may also be dispersed in dentifrices, including: 
gels, pastes, powders and slurries. The active ingredient may be added in a therapeutically 
effective amount to a paste dentifrice that may include water, binders, abrasives, flavoring 
agents, foaming agents, and humectants. 

The compositions of the present invention may be formulated in a neutral or salt form. 
Phannaceutically-acceptable salts include the acid addition salts (formed with the free amino 
groups of the protein) and which are formed with inorganic acids such as, for example, 
hydrochloric or phosphoric acids, or such organic acids as acetic, oxalic, tartaric, mandelic, and 
the like. Salts formed with the free carboxyl groups can also be derived from inorganic bases 
such as, for example, sodium, potassium, ammonium, calcium, or ferric hydroxides, and such 
organic bases as isopropylamine, trimethylamine, histidine, procaine and the like. 

Upon formulation, solutions will be administered in a manner compatible with the dosage 
formulation and in such amount as is therapeutically effective. The formulations are easily 
administered in a variety of dosage forms such as injectable solutions, drug release capsules and 
the like. For parenteral administration in an aqueous solution, for example, the solution should 
be suitably buffered if necessary and the liquid diluent first rendered isotonic with sufficient 
saline or glucose. These particular aqueous solutions are especially suitable for intravenous, 
intramuscular, subcutaneous and intraperitoneal administration. In this connection, sterile 
aqueous media which can be employed will be known to those of skill in the art in light of the 
present disclosure. For example, one dosage could be dissolved in 1 ml of isotonic NaCl 
solution and either added to 1000 ml of hypodermoclysis fluid or injected at the proposed site of 
infusion, (see for example, "Remington's Pharmaceutical Sciences" 15th Edition, pages 1035- 
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1038 and 1570-1580), Some variation in dosage will necessarily occur depending on the 
condition of the subject being treated. The person responsible for administration will, in any 
event, determine the appropriate dose for the individual subject. Moreover, for human 
administration, preparations should meet sterility, pyrogenicity, general safety and purity 
5 standards as required by FDA Office of Biologies standards. 

VTL Transgenic Animals/Knockout Animals 

In one embodiment of the invention, transgenic animals are produced which contain a 
functional transgene encoding a functional TS10q23.3 polypeptide or variants thereof. 
10 Transgenic animals expressing TS10q23.3 transgenes, recombinant cell lines derived from such 
animals and transgenic embryos may be useful in methods for screening for and identifying 
agents that induce or repress function of TS10q23.3. Transgenic animals of the present invention 
also can be used as models for studying indications such as cancers. 

In one embodiment of the invention, a TS10q23.3 transgene is introduced into a non- 
15 human host to produce a transgenic animal expressing a human or murine TS10q23.3 gene. The 
transgenic animal is produced by the integration of the transgene into the genome in a manner 
that permits the expression of the transgene. Methods for producing transgenic animals are 
generally described by Wagner and Hoppe (U.S. Patent No. 4,873,191; which is incorporated 
herein by reference), Brinster et al 1985; which is incorporated herein by reference in its 
20 entirety) and in "Manipulating the Mouse Embryo; A Laboratory Manual" 2nd edition (eds., 
Hogan, Beddington, Costantimi and Long, Cold Spring Harbor Laboratory Press, 1994; which is 
incorporated herein by reference in its entirety). 

It may be desirable to replace the endogenous TS10q23.3 by homologous recombination 
between the transgene and the endogenous gene; or the endogenous gene may be eliminated by 
25 deletion as in the preparation of "knock-out" animals. Typically, a TS10q23.3 gene flanked by 
genomic sequences is transferred by microinjection into a fertilized egg. The microinjected eggs 
are implanted into a host female, and the progeny are screened for the expression of the 
transgene. Transgenic animals may be produced from the fertilized eggs from a number of 
animals including, but not limited to reptiles, amphibians, birds, mammals, and fish. Within a 
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particularly preferred embodiment, transgenic mice are generated which overexpress TS10q23.3 
or express a mutant form of the polypeptide. Alternatively, the absence of a TS10q23.3 in 
"knock-out" mice permits the study of the effects that loss of TS10q23.3 protein has on a cell in 
vivo. Knock-out mice also provide a model for the development of TS10q23.3-related cancers. 

5 As noted above, transgenic animals and cell lines derived from such animals may find use 

in certain testing experiments. In this regard, transgenic animals and cell lines capable of 
expressing wild-type or mutant TS10q23.3 may be exposed to test substances. These test 
substances can be screened for the ability to enhance wild-type TS10q23.3 expression and or 
function or impair the expression or function of mutant TS10q23.3. 

10 

Vm. Examples 

The following examples are included to demonstrate preferred embodiments of the 
invention. It should be appreciated by those of skilled the art that the techniques disclosed in the 
examples which follow represent techniques discovered by the inventor to function well in the 

15 practice of the invention, and thus can be considered to constitute preferred modes for its 
practice. However, those of skill in the art should , in light of the present disclosure, appreciate 
that many changes can be made in the specific embodiments which are disclosed and still obtain a 
like or similar result without departing from the concept, spirit and scope of the invention. More 
specifically, it will be apparent that certain agents which are both chemically and physiologically 

20 related may be substituted for the agents described herein while the same or similar results would 
be achieved. All such similar substitutes and modifications apparent to those skilled in the art 
are deemed to be within the spirit, scope and concept of the invention as defined by the 
appended claims. 

Example J - Homozygous Deletions in Glioma Cell Lines 

25 The inventors have examined DNA from a series of 21 glioma cell lines and primary 

cultures, along with normal cells, to identify homozygous deletions of genomic material on 
chromosome 10. Markers were chosen for their approximate location at or near previously 
implicated regions (FIG. 1). The cells analyzed were generated in the Department of Neuro- 
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Oncology UTMDACC (LGll, EFC-2, PL-1, PC-1, JW, FG-2, FG-0, NG-1, PH-2, KE, PC-3, 
and D77), were commercially available (U138, A172, U373, U87, U251, U118, and T98G), or 
obtained from collaborators (13 wk astro, D54-MG). Markers were obtained from Research 
Genetics, Huntsville, AL, or synthesized from reported sequence. Once cell line, EFC-2, 
5 revealed a large homozygous deletion associated with four markers surrounding D10S215 
(FIG. 2). This deletion was also observed by FISH using YAC 746h6, which maps to the 
region. Three other cell lines (D-54, A172, and LG1 1) also demonstrated homozygous deletions 
at AFMA086WG9 (AFM086), thereby strongly implicating the region to contain a putative 
tumor suppressor gene (FIG. 2). Deletions in PCR™ reactions were performed in the presence 
10 of two primer pairs (multiplexed) to assure appropriate amplification conditions. All deletions 
were confirmed by (at least) triplicate reactions. This same region has also been implicated in 
prostate carcinoma (Gray et al, 1995). Homozygous deletions in cell lines also have been used 
to define a tumor suppressor gene locus at 3p21.3 in small cell lung carcinoma (Daly et al, 
1993;Koke/a/., 1994; Wei et al, 1996). 

15 Example 2 - Retention of lOqLoci in Suppressed Hybrid Cells 

The inventors' second strategy was to examine the regions of chromosome 10 that were 
retained in suppressed hybrid clones, but absent in the revertant clones. This analysis extended 
the inventors' previous study, showing the presence of two tumor suppressor loci on 
chromosome 10 and analyzing the regions that were retained. Hybrids retaining all or portions 

20 of lOq failed to grow in soft agarose and in nude mice ("fully" suppressed clones), while hybrid 
cells that lost the majority of the inserted chromosome lOq grew in soft agarose, but were 
nontumorigenic ("partially" suppressed clones; Steck et al t 1995; FIG. 3, right side). Original 
clones U251N10.6, N10.7, and N10.8 previously were shown to retain only fragments of lOq 
(Pershouse et al., 1993; Steck et al, 1995). Using additional informative microsatellite markers, 

25 three retained regions were identified in all three suppressed clones; a 22 cM region from 
D10S219 to D10S110, a 14 cM region from D10S192 to D10S187, and a 18 cM region from 
D10S169 through D10S1 134 (FIG. 3). 

To bypass this limitation, the originally transferred neomycin resistance-tagged 
chromosome 10 from hybrid U251.N10.7 was "rescued" by microcell-mediated chromosome 
30 transfer into mouse A9 cells. This allows all human microsatellite markers to be informative for 
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the presence of chromosome 10. The basis for this analysis is that all "fully" suppressed 
subclones should retain a common region and this region is deleted in the "partially" suppressed 
subclones. An additional impetus was that N10.7 displayed considerable heterogeneity in the 
size of chromosome 10 retained, as determined by FISH using chromosome 10 specific probes. 

5 Also, hybrid cells used for this rescue were first assayed for soft agarose growth and showed no 
colony formation. The mouse hybrids containing the transferred human chromosome 10 all 
contained the short arm of chromosome 10. The same region was retained in the "partially" 
suppressed clones (N10.5a-j) that grew in soft agarose (Steck et al t 1995), thus excluding this 
region (lOpter-lOql 1) as containing the lOq tumor suppressor gene. Examination of the 

10 retained regions of lOq illustrated considerable heterogeneity (FIG. 3). The majority of clones 
showed either partial or extensive deletions of 10q23-26. Only two regions were retained in all 
the subclones examined. The most centromeric region retained involved the markers D10S210 
and D10S219. However, these markers were absent in the original N10.6 and/or N10.8 clones, 
excluding this region (FIG. 3). The other region was centromeric of D4S536 but telomeric of 

15 D10S215 (-4 cM). The markers AFM086 and D10S536 were retained in all clones examined 
(boxed region in FIG. 3). These markers were absent in the partially suppressed clones 
(N10.5a-j). These results demonstrate that a common region, surrounding AFM086, is retained 
in all hybrid cells that are phenotypically suppressed. This same region is deleted in several 
glioma cell lines. 

20 This analysis has several limitations. First, the rescued clones cannot be analyzed for 

biological activity, therefore any changes in chromosome 10 which may have occurred during or 
after transfer could not be detected. To partially address this concern, the inventors' analysis 
was performed as soon as the clones were able to be harvested. Furthermore, retention of this 
portion of the chromosome may only "correct" an in vitro artifactual deletion. Consequently, 

25 allelic deletion studies were performed to determine if this region was involved in gliomas. Also, 
an alternative region was suggested by this analysis atD10S1158, where all the clones but one 
(C7) retained this region. However, the retained region at AFM086 also exhibited homozygous 
deletions, thereby being implicated by two alternative methods as compared to D10S1 158. It is 
also interesting to note that the tumor suppressor gene region appears to be preferentially 

30 retained , while the remainder of lOq is fragmented. 
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Example 3 - Allelic Deletion Analysis of lOq 

An allelic deletion study was performed on DNA from a series of 53 glioma specimens 
and corresponding patient lymphocytes using microsatellite markers specific for chromosome 10. 
This study was undertaken to determine if our critical region also was involved in glioma 
5 specimens. Extensive deletions were observed in the majority of specimens derived from GBM, 
with 30 of 38 GBMs exhibiting deletion of most or all of chromosome 10 markers. Less 
extensive deletions were observed in the majority if specimens derived from anaplastic 
astrocytomas, while infrequent deletions were observed in astrocytomas and most 
oligodendrogliomas (FIG. 4 and data not shown). The majority of markers used in this analysis 
10 mapped to 10q23-26 (Gyapay et al, 1994). Similar to other studies, a common region of 
deletion could not be convincingly demonstrated, due to the large deletions in most GBM 
samples (Fults et al., 1993; Rasheed et al, 1995). 

However, for the GBM specimens examined, all but one tumor sample (#9; FIG. 4) 
revealed deletions involving the region from D10S579 to D10S541. Furthermore, only one AA 

15 showed a deletion at the inventors' critical region, and no astrocytomas. Two 
oligodendrogliomas exhibited deletions within the critical region, but both were diagnosed as 
malignant. This study presents several possibilities. First, the deletions involving the inventors' 
critical region occur predominantly in GBMs and not in lower grade tumors. This would imply 
that loss of the tumor suppressor gene on chromosome lOq in the inventors' critical region 

20 would represent a genetic alteration associated with progression to GBM. In support of this 
hypothesis, even though deletions occur on lOq in lower grade tumors, no common region of 
deletion on lOq was identified for these specimens. This observations would, again, support the 
inventors' previous suggestion that deletion of the lOq tumor suppressor gene is predominantly 
associated with GBMs and not all deletions on lOq affect the tumor suppressor gene. The 

25 region D10S216 to D10S587, suggested by Rasheed, showed extensive deletions, but several 
GBMs exhibited retention of heterozygosity at this region (tumors #2, #9, #13, #26; FIG. 4). 
Also, if low grade tumors are excluded from their study, the inventors' region is implicated in all 
GBMs. This combination of independent approaches strongly suggests a lOq tumor suppressor 
gene maps to the region D10S215 to D10S541, specifically at AFM086. 
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Example 4 - Mapping of Candidate Tumor Suppressor Gene Region 

The critical region the inventors have identified is centered at AFM086 and is bordered 
by D10S215 and S10S541 (FIGS. 2 and 8). This region is relatively small, being contained 
within several individual YACs (787d7; 746h8; 934d3). FISH painting with YAC 746h8 on 
EFC-2 metaphase spreads shows that the homozygous deletion is contained within the YAC as 
the YAC was partially observed and adjacent YACs on both sides were present. Bacterial 
artificial chromosomes (BACs) or PACs for all markers in the region have been isolated 
(FIG. 8). The BAC contig of the region was constructed from end sequences of BACs mapping 
to the region. Several notable features have been identified. First, two overlapping BACs were 
identified (46b 12 and 2£20) and verify the genomic integrity of 106dl6. Second, a Not I site was 
identified at one end of the BACs. The presence of the Not I site and coincident restriction 
digestion with Sacl\ Eagl, and jBssHII suggest the presence of a CpG island within 106dl6. 

The EcoRl fragments from BAC 106dl6 were used to examine the extent of the 
homozygous deletions, by Southern blotting, in the glioma cells that were previously shown to 
have homozygously deleted AFM086 (FIGS. 2 and 5). The right side (Eco'Rl fragment 14) 
contains the probable CpG island and is present in three of the four cell lines. A NotVEcdKl 
(#3) fragment was used as a probe on a Southern blot containing several BACs and the glioma 
cell line (FIG. 2). Deletions to the telomeric side (right side) have not been detected using 
probes from 46b 12, except for EFC-2 cells. However, additional homozygous deletions have 
been observed in the cells within the region defined by 106dl6 (-65 kb). A homozygous 
deletion for band 3 is observed for LG1 1 and EFC-2 cells, but not the additional glioma cells or 
normal controls. 106dl6 (band 12) has been observed to be present in all cells (EFC-2 exhibits 
an altered migrating band), suggesting the homozygous deletion is contained entirely within 
106dl6. 

Example 5 - Identification of Expressed Genes within the Critical Region 

EcoRl fragments from BAC 106dl6 were generated and size separated by agarose gel 
electrophoresis. Individual bands or pools of similar sized bands were ligated into pSPL3 
(GIBCO, Gaithersburg, MD). Putative exons were identified as described by the manufacturer. 
Two exons were properly spliced into the trapping vector. The exons were derived from band 
pool 2, 3, 4, 5 and band 7. The sequence of the trapped exons was determined and defined by 
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the known trapping vector sequence. Using BLAST searches of expressed sequence tag 
(dbEST) database, five potential expressed sequence tags (ESTs) were identified. Two ESTs 
(gb/H92038, AA009519) were observed to contain either one or both of the exons (albeit one 
EST was in the wrong orientation). 

5 Sequencing primers were generated from the ESTs and used to define putative exon- 

intron boundaries using BAC46M2 as a template. Nine exons were identified. Sequence 
differences between the ESTs and the genomic template were corrected. All the exons were 
contained within BAC 46b 12. Primers were generated from the intron sequences adjacent to the 
exons to form amplicon units for each exon. Two of the exons were corresponded to the 

10 trapped exons from the BAC 106dl6 EcoKL sequences. The sequence of the gene is shown in 
FIG. 6. The predicted amino acid reading was defined by the presence of an ATG start site, 
TGA and TAA stop codons in frame, the presence of multiple stop codons in all three reading 
frames elsewhere in the sequence, nine splicing sites, and the presence of Kozak signals near the 
initiation site. The 403 amino acid sequence is shown in FIG. 7 and FIG. 9. The predicted 

15 molecular weight is 47, 122 with a pi of 5.86, 

A possible functional role for the protein product is suggested by its sequence homology 
to several protein motifs. A critical motif from residues 88 to 98 [IHCKAGKGRTG] (SEQ ID 
NO: 17) has an exact match for the conserved catalytic domain of a protein tyrosine phosphatase 
[(IAOHCxAGxxR(S/T)G] (SEQ ID NO:18) (Denu et aL, 1996). Several other motifs were 
20 identified that would agree with the phosphatase function for the tumor suppressor gene. 

Amplicons (PCR™ products generated from various regions of the gene) were generated 
from random primed cDNA. The amplicons sequence corresponded to the DNA sequence. 
Non-overlapping amplicons were used to probe Northern blots of normal tissue derived from 
various organs (Clontech, Palo Alto, CA; multitissue blots). All amplicons identified a major 
25 band at 5.5 to 6 kb on the Northern blots and several minor bands. The message was expressed 
in all tissues examined (heart, brain, placenta, lung, liver skeletal muscle, kidney, pancreas, 
spleen, thymus, prostate, testes, ovary, small intestine, colon and peripheral blood lymphocytes). 
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Example 6 - Mutational Analysis 

The mutational analyses have initially proceeded on two fronts. First, the glioma cell 
lines initially shown to have homozygous deletions were analyzed for the presence of the 
candidate gene. As shown in FIG, 8, all of the cell lines that exhibited deletion of AFM086 had 
homozygous deletions of multiple exons of the candidate gene. Furthermore, the deletions 
occurred in the middle of the gene, thus defining the deletion boundaries (similar deletions in all 
cell lines) between exons B and G. Deletions that affect the middle of the gene further indicate 
that the identified gene represents the gene targeted for mutation. 

Preliminary analysis for sequence mutations was also performed on a series of glioma cell 
lines. Mutations and/or deletions were observed in all but three glioma cell lines examined 
(Table 4). Reference to base number in the table references the exon, not the entire sequences, 
/. e. , the 98th base of exon G for U25 1 . 

TABLE 4 



IDENTIFIED MUTATIONS IN CANDIDATE GENE 



Cells 


Cell Type 


Mutation 


Predicted Effect 


1 


U87 


glioma 


splice junction exon c: G+1>T 


splicing variant 


2 


U138 


glioma 


splicing site exon h; G+1>T 


splicing variant 


3 


U251 


glioma 


2 bp addition exon G; 98 ins TT 




4 


U373 


glioma 


frame shift exon G 




5 


EFC-2 


glioma 


-all exons 


no product 


6 


D54 


glioma 


-exons C-I 


no product 


7 


A172 


glioma 


-exons C-I 


no product 


8 


LG11 


glioma 


-exons B-I 


no product 


9 


T98G 


glioma 


missense exon B; T46->G 


leu>arg 


10 


KE 


glioma 


missense exon B; G28->A 


gly>Rlu 


11 


F60 


glioma 


terminal mutation exon H; 
C202->T 


terminal stop 


12 


D77 


glioma 


no mutation (heterogeneous for 
lOq 




13 


PC-3 


low grade 


no mutation 




14 


PH-2 


low grade 


no mutation 




15 


nLnCap 


prostate 


deletion exon A, 16-17 del AA; 
mutation B, C53->T 


silent 
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Also, deletions of exons were found in LnCap, a prostate cell line. The glioma cells that 
failed to show a mutation/deletion were derived from low grade tumors (PC-3 and PH-2) where 
no allelic deletion of chromosome 10 is expected and has been observed for these cells. The 
other cells (D77) were a primary cell culture, and chromosome 10 was shown to be 
heterozygous from a 1 bp polymorphism within the gene. A breast cancer cell line also showed a 
mutation. This initial analysis supports the inventors' conclusion that loss of a lOq tumor 
suppressor gene represents a critical molecular marker for glioblastoma and disease progression. 
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(1) GENERAL INFORMATION : 

(i) APPLICANT: 

(A) NAME: Board of Regents, The University of Texas 

System 

(B) STREET: 201 West 7th Street 

(C) CITY: Austin 

(D) STATE: Texas 

(E) COUNTRY: USA 

(F) POSTAL CODE (ZIP): 78701 

(A) NAME: Myriad Genetics, Inc. 

(B) STREET: 320 Wakara Way 

(C) CITY: Sal Lake City 

(D) STATE: Utah 

(E) COUNTRY: USA 

(F) POSTAL CODE (ZIP) : 84108 

(A) NAME: Peter Steck 

(B) STREET: 5421 Holly 

(C) CITY: Bellaire 

(D) STATE: Texas 

(E) COUNTRY: USA 
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(A) NAME: Mark A. Pershouse 

(B) STREET: 4065 McDermed 

(C) CITY: Houston 

(D) STATE: Texas 

(E) COUNTRY: USA 

(F) POSTAL CODE (ZIP) : 77025 

(A) NAME: Samar A. Jasser 

(B) STREET: 9411 Kristin 

(C) CITY: Houston 

(D) STATE: Texas 

(E) COUNTRY: USA 

(F) POSTAL CODE (ZIP) : 77031 

(A) NAME: W.K. Alfred Yung 

(B) STREET: 4141 Byron 

(C) CITY: Houston 

(D) STATE: Texas 

(E) COUNTRY: USA 

(F) POSTAL CODE (ZIP) : 77005 

(A) NAME: Sean V. Tantigian 

(B) STREET: 557 East 1st Avenue #3 

(C) CITY: Salt Lake City 

(D) STATE: Utah 

(E) COUNTRY: USA 

(F) POSTAL CODE (ZIP): 84103 

(ii) TITLE OF INVENTION: A TUMOR SUPPRESSOR DESIGNATED TS10Q23.3 
(iii) NUMBER OF SEQUENCES: 21 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC- DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 
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(v) CURRENT APPLICATION DATA: 

APPLICATION NUMBER: Unknown 
(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/791,115 

(B) FILING DATE: 30-JAN-1997 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 403 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

Met Thr Ala He lie Lys Glu He Val Ser Arg Asn Lys Arg Arg Tyr 
5 ' 10 



15 



Gin Glu Asp Gly Phe Asp Leu Asp Leu Thr Tyr He Tyr Pro Asn He 
20 25 30 

He Ala Met Gly Phe Pro Ala Glu Arg Leu Glu Gly Val Tyr Arg Asn 
. 40 45 

Asn lie Asp Asp Val Val Arg Phe Leu Asp Ser Lys His Lys Asn His 
3U 55 60 

Tyr Lys He Tyr Asn Leu Cys Ala Glu Arg His Tyr Asp Thr Ala Lys 
b5 7 ° 75 8 o 

Phe Asn Cys Arg Val Ala Gin Tyr Pro Phe Glu Asp His Asn Pro Pro 
85 90 95 

Gin Leu Glu Leu He Lys Pro Phe Cys Glu Asp Leu Asp Gin Trp Leu 
100 105 110 

Ser Glu Asp Asp Asn His Val Ala Ala He His Cys Lys Ala Gly Lys 

120 125 

Gly Arg Thr Gly Val Met He Cys Ala Tyr Leu Leu His Arg Gly Lys 

135 

Phe Leu Lys Ala Gin Glu Ala Leu Asp Phe Tyr Gly Glu Val Arg Thr 

143 150 155 160 

Arg Asp Lys Lys Gly Val Thr He Pro Ser Gin Arg Arg Tyr Val Tyr 
165 170 175 

Tyr Tyr Ser Tyr Leu Leu Lys Asn His Leu Asp Tyr Arg Pro Val Ala 
180 185 190 

Leu Leu Phe His Lys Met Met Phe Glu Thr He Pro Met Phe Ser Gly 

l^j onn ■* 



205 



Gly Thr Cys Asn Pro Gin Phe Val Val Cys Gin Leu Lys Val Lys He 

215 220 

Tyr Ser Ser Asn Ser Gly Pro Thr Arg Arg Glu Asp Lys Phe Met Tyr 
^ 230 235 240 

Phe Glu Phe Pro Gin Pro Leu Pro Val Cys Gly Asp He Lys Val Glu 
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245 250 255 

Phe Phe His Lys Gin Asn Lys Met Leu Lys Lys Asp Lys Met Phe His 
260 265 270 

Phe Trp Val Asn Thr Phe Phe lie Pro Gly Pro Glu Glu Thr Ser Glu 
275 280 285 

Lys Val Glu Asn Gly Ser Leu Cys Asp Gin Glu lie Asp Ser lie Cys 
290 295 300 

Ser lie Glu Arg Ala Asp Asn Asp Lys Glu Tyr Leu Val Leu Thr Leu 
305 310 315 320 

Thr Lys Asn Asp Leu Asp Lys Ala Asn Lys Asp Lys Ala Asn Arg Tyr 
325 330 335 

Phe Ser Pro Asn Phe Lys Val Lys Leu Tyr Phe Thr Lys Thr Val Glu 
340 345 350 

Glu Pro Ser Asn Pro Glu Ala Ser Ser Ser Thr Ser Val Thr Pro Asp 
355 360 365 

Val Ser Asp Asn Glu Pro Asp His Tyr Arg Tyr Ser Asp Thr Thr Asp 
370 375 380 

Ser Asp Pro Glu Asn Glu Pro Phe Asp Glu Asp Gin His Thr Gin lie 
385 390 395 400 

Thr Lys Val 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3160 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

CCTCCCCTCG CCCGGCGCGG TCCCGTCCGC CTCTCGCTCG CCTCCCGCCT CCCCTCGGTC 60 

TTCCGAGGCG CCCGGGCTCC CGGCGCGGCG GCGGAGGGGG CGGGCAGGCC GGCGGGCGGT 120 

GATGTGGCAG GACTCTTTAT GCGCTGCGGC AGGATACGCG CTCGGCGCTG GGACGCGACT 180 

GCGCTCAGTT CTCTCCTCTC GGAAGCTGCA GCCATGATGG AAGTTT GAGA GTTGAGCCGC 24 0 

TGTGAGGCGA GGCCGGGCTC AGGCGAGGGA GAT GAGAG AC GGCGGCGGCC GCGGCCCGGA 300 

GCCCCTCTCA GCGCCTGTGA GCAGCCGCGG GGGCAGCGCC CTCGGGGAGC CGGCCGGCCT 360 

GCGGCGGCGG CAGCGGCGGC GTTTCTCGCC TCCTCTTCGT CTTTTCTAAC CGTGCAGCCT 420 

CTTCCTCGGC TTCTCCTGAA AGGGAAGGTG GAAGCCGTGG GCTCGGGCGG GAGCCGGCTG 480 

AGGCGCGGCG GCGGCGGCGG CGGCACCTCC CGCTCCTGGA GCGGGGGGGA GAAGCGGCGG 540 

CGGCGGCGGC CGCGGCGGCT GCAGCTCCAG GGAGGGGGTC TGAGTCGCCT GTCACCATTT 600 

CCAGGGCTGG GAACGCCGGA GAGTTGGTCT CTCCCCTTCT ACTGCCTCCA ACACGGCGGC 660 
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GGCGGCGGCG 


GCACAT C CAG 


GGACCCGGGC 


CACCCCCCGT 


GGCCCGGGCT 


CCGGAGGCCG 


CGTCTTCTCC 


CCATTCCGCT 


GCCGCCGCTG 


CCCAGTCGCT 


GCAACCATCC 


AGCAGCCGCC 


GCCAAGCGGC 


G G CAG AG C GA 


GGGGCATCAG 


TGCAGAAGAA 


GCCCCGCCAC 


CAG CAG CTT C 


ACAGGCTCCC 


AGACATGACA 


GCCATCATCA 


AT CAAGAGGA 


TGGATTCGAC 


TTAGACTTGA 


GATTTCCTGC 


AGAAAGACTT 


GAAGGCGTAT 


TTTTGGATTC 


AAAGCATAAA 


AACCATTACA 


ATGACACCGC 


CAAATTTAAT 


TGCAGAGTTG 


CACAGCTAGA 


ACTTAT CAAA 


CCCTTTTGTG 


ACAAT CAT GT 


TGCAGCAATT 


CACT GTAAAG 


GTGCATATTT 


ATTACAT CGG 


GGCAAATTTT 


GGGAAGTAAG 


GAC CAGAGAC 


AAAAAGGGAG 


ATTATTATAG 


CTACCTGTTA 


AAGAAT CATC 


ACAAGAT GAT 


GTTTGAAACT 


ATTCCAATGT 


TGGTCTGCCA 


GCTAAAGGTG 


AAGATATATT 


ACAAGTT CAT 


GTACTTTGAG 


TTCCCTCAGC 


AGTTCTTCCA 


CAAACAGAAC 


AAGAT GCTAA 


ATACATT CTT 


CAT AC CAG G A 


CCAGAGGAAA 


GT GAT CAAGA 


AATCGATAGC 


ATTTGCAGTA 


TAGTACTTAC 


TTTAACAAAA 


AATGATCTTG 


ACTTTTCTCC 


AAATTTTAAG 


GTGAAGCTGT 


ATCCAGAGGC 


TAGCAGTTCA 


ACTTCTGTAA 


ATTATAGATA 


TTCTGACACC 


ACTGACTCTG 


AGCATACACA 


AATTACAAAA 


GTCTGAATTT 


AAATAAACTT 


GAATAAACTG 


AAAAT GGACC 


GTCAGATTAC 


CAGTTATAGG 


AACAATTCTC 


CATCCACAGG 


GTTTT GACAC 


TTGTTGTCCA 


TAT AT AC CTT 


TTTGTGTCAA 


AAGGACATTT 


CTTTCCCGTT 


TTATTCCAGT 


TTTATAAAAA 
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CGGTTTTAAA 


CCTCCCGTCC 


GCCGCCGCCd 


720 


CCGGCGGAGG 


CAGCCGTTCG 


GAGGATTATT 


780 


CCAGGCCTCT 


GGCTGCTGAG 


GAGAAGCAGG 


840 


GCAGCAGCCA 


TTACCCGGCT 


GCGGTCCAGA 


900 


CTACCGCCAA 


GTCCAGAGCC 


ATTTCCATCC 


Q60 


TGCCATCTCT 


CTCCTCCTTT 


TTCTTCAGCC 


x. u^. u 


AAGAGATCGT 


TAGCAGAAAC 


AAAAGGAGAT 


i ono 

X W o u 


CCTATATTTA 


TCCAAACATT 


ATTGCTATGG 


1 1 an 


ACAGGAACAA 


TATTGATGAT 


GTAGT A AGGT 




AGATATACAA 


TCTTTGTGCT 


GAAAGACATT 




CACAAT AT C C 


TTTTGAAGAC 


CATAACCCAC* 




AAGAT CTT GA 


CCAATGGCTA 


AGTGAAGATG 


1 "5 fl o 


CTGGAAAGGG 


ACGAACTGGT 


GTAATGATAT 


i a a n 

J. fl ft u 


TAAAGGCACA 


AGAGGCCCTA 


GATTTCTATG 


IjUU 


TAACTATTCC 


CAGTCAGAGG 


C G CT AT GT GT 


IJOU 


TGGATTATAG 


ACCAGTGGCA 


CTGTTRTTTr 

* w X X w X X X V* 




TCAGTGGCGG 


AACTTGCAAT 


CCTCAGTTTG 


X O O \J 


CCTCCAATTC 


AGGACCCACA 


C GAC GGGAAG 


174 0 


CGTTACCTGT 


GTGTGGTGAT 


ATCAAAGTAG 


1800 
1 u u u 


AAAAGGACAA 


AATGTTTCAC 


TTTTGGGTAA 


1860 


CCTCAGAAAA 


AGTAGAAAAT 


GGAAGTCTAT 


1920 


TAGAGCGTGC 


AGATAATGAC 


AAGGAATATC 


1980 


ACAAAGCAAA 


TAAAGACAAA 


GCCAACCGAT 


2040 


ACTT CACAAA 


AACAGTAGAG 


GAGCCGTCAA 


2100 


CACCAGATGT 


TAGT GACAAT 


GAACCTGATC 


2160 


AT C CAGAGAA 


TGAACCTTTT 


GAT G AAGAT C 


2220 


TTTTTTATCA 


AGAGGGATAA 


AACACCAT GA 


2280 


TTTTTTTTTT 


T AAT GGPAAT 


fir* Ti (~* T\ rprn /"-ra 
IVuUAWvi TGT 


2340 


TTTTCCTGAC 


CAATCTTGTT 


TTACCCTATA 


2400 


GTTGAAAAAA 


GGTTGTGTAG 


CTGTGTCATG 


2460 


AAAATTCAAT 


TAGGATTAAT 


AAAGAT GGCA 


2520 


GTGGAGACAG 


ACT GAT GT GT 


AT AC GT AG GA 


2580 
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ATTTTTTCCT TTTGTGTTCT GTCACCAACT GAAGTGGCTA AAGAGCTTTG TGATATACTG 264 0 

GTTCACATCC TACCCCTTTG CACTTGTGGC AACAGATAAG TTTGCAGTTG GCTAAGAGAG 2700 

GTTTCCGAAA GGTTTTGCTA CCATTCTAAT GCATGTATTC GGGTTAGGGC AAT GGAGGGG 2760 

AATGCTCAGA AAGGAAATAA TTTTATGCTG GACTCTGGAC CATATACCAT CT CCAGCTAT 2820 

TTACACACAC CTTTCTTTAG CATGCTACAG TTATTAATCT GGACATTCGA GGAATTGGCC 2880 

GCTGTCACTG CTTGTTGTTT GCGCATTTTT TTTTAAAGCA TATTGGTGCT AGAAAAGGCA 2940 

GCTAAAGGAA GTGAATCTGT ATTGGGGTAC AGGAATGAAC CTTCTGCAAC AT CTTAAGAT 3000 

CCACAAATGA AGGGATATAA AAATAATGTC ATAGGTAAGA AACACAGCAA CAATGACTTA 3060 

ACCATATAAA TGTGGAGGCT ATCAACAAAG AATGGGCTTG AAACATTATA AAAATTGACA 3120 

AT GAT T T ATT AAAT AT GTTT TCTCAATTGT AAAAAAAAAA 3160 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1962 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



GCGAGGGAGA 


TGAGAGACGG 


CGGCGGCCAC 


GGCCCAGAGC 


CCCTCTCAGC 


GCCTGTGAGC 


60 


AGCCGCGGGG 


GCAGCGCCCT 


CGGGGAGCCG 


GCCGGGCGGC 


GGCGGCGGCA 


GCGGCGGCGG 


120 


GCCTCGCCTC 


CTCGTCGTCT 


GTTCTAACCG 


GGCAGCTTCT 


GAGCAGCTTC 


GGAGAGAGAC 


180 


GGTGGAAGAA 


GCCGTGGGCT 


CGAGCGGGAG 


CCGGCGCAGG 


CTCGGCGGCT 


GCACCTCCCG 


240 


CTCCTGGAGC 


GGGGGGGAGA 


AGCGGCGGCG 


GCGGCCGCGG 


CTCCGGGGAG 


GGGGTCGGAG 


300 


TCGCCTGTCA 


CCATTGCCAG 


GGCT GGGAAC 


GCCGGAGAGT 


TGCTCTCTCC 


CCTTCTCCTG 


360 


CCTCCAACAC 


GGCGGCGGCG 


GCGGCGGCAC 


GTCCAGGGAC 


CCGGGCCGGT 


GTTAAGCCTC 


420 


CCGTCCGCCG 


CCGCCGCACC 


CCCCCTGGCC 


CGGGCTCCGG 


AGGCCGCCGG 


AGGAGGCAGC 


480 


CGCTGCGAGG 


ATTATCCGTC 


TTCTCCCCAT 


TCCGCTGCCT 


CGGCTGCCAG 


GCCTCTGGCT 


540 


GCTGAGGAGA 


AGCAGGCCCA 


GTCTCTGCAA 


CCATCCAGCA 


GCCGCCGCAG 


CAG CCATT AC 


600 


CCGGCTGCGG 


TCCAGGGCCA 


AGCGGCAGCA 


GAGCGAGGGG 


CATCAGCGAC 


CGCCAAGTCC 


660 


AGAGCCATTT 


CCATCCTGCA 


GAAGAAGCCT 


CGCCACCAGC 


AGCTTCTGCC 


ATCTCTCTCC 


720 


TCCTTTTTCT 


TCAGCCACAG 


GCTCCCAGAC 


AT GACAGCCA 


T CAT CAAAG A 


GATCGTTAGC 


780 


AGAAACAAAA 


GGAGATAT CA 


AGAGGAT GGA 


TTCGACTTAG 


ACTTGACCTA 


TATTTATCCA 


840 


AATATTATTG 


CTATGGGATT 


TCCTGCAGAA 


AGACTTGAAG 


GTGTATACAG 


GAACAATATT 


900 


GAT GAT GT AG 


TAAGGTTTTT 


GGATTCAAAG 


CATAAAAACC 


ATTACAAGAT 


ATACAATCTA 


960 



WO 98/33907 PCI7US98/00353 

- 102 - 



TGTGCTGAGA 


GACATTATGA 


CACCGCCAAA 


TTTAACTGCA 


GAGTTGCACA 


GTATCCTTTT? 


1020 


GAAGAC CAT A 


ACCCACCACA 


GCTAGAACTT 


ATCAAACCCT 


TCTGT GAAGA 


TCTTGACCAA 


1080 


TGGCTAAGTG 


AAGATGACAA 


TCATGTTGCA 


GCAATTCACT 


GTAAAGCTGG 


AAAGGGACGG 


1140 


ACTGGTGTAA 


TGATTTGTGC 


ATATTTATTG 


CATCGGGGCA 


AATTTTTAAA 


GGCACAAGAG 


1200 


GCCCTAGATT 


TTTATGGGGA 


AGTAAGGACC 


AGAGACAAAA 


AGGGAGTCAC 


AATTCCCAGT 


1260 


CAGAGGCGCT 


AT GTATATTA 


TTATAGCTAC 


CTGCTAAAAA 


ATCACCTGGA 


TTACAGACCC 


1320 


GTGGCACTGC 


TGTTTCACAA 


GATGATGTTT 


GAAACTATTC 


CAATGTTCAG 


TGGCGGAACT 


1380 


TGCAATCCTC 


AGTTTGTGGT 


CTGCCAGCTA 


AAGGT GAAGA 


TATATTCCTC 


CAATT CAGGA 


1440 


CCCACGCGGC 


GG GAGGACAA 


GTT CAT GT AC 


TTTGAGTTCC 


CTCAGCCATT 


GCCTGTGTGT 


1500 


GGTGATATCA 


AAGTAGAGTT 


CTTCCACAAA 


CAGAACAAGA 


TGCTCAAAAA 


GGACAAAAT G 


1560 


TTTCACTTTT 


GG GTAAAT AC 


GTTCTTCATA 


C CAGGAC CAG 


AGGAAACCTC 


AGAAAAAGTG 


1620 


GAAAATGGAA 


GTCTTTGTGA 


TCAGGAAATC 


GAT AG CATTT 


GCAGTATAGA 


GCGTGCAGAT 


1680 


AATGACAAGG 


AGTATCTTGT 


ACTCACCCTA 


ACAAAAAACG 


ATCTTGACAA AGCAAACAAA 


1740 


GACAAGGCCA 


AC C GATACTT 


CTCTCCAAAT 


TTTAAGGTGA 


AACTATACTT 


TACAAAAACA 


1800 


GTAGAGGAGC 


CATCAAATCC 


AGAGGCTAGC 


AGTTCAACTT 


CTGTGACTCC 


AGATGTTAGT 


1860 


GACAATGAAC 


CTGATCATTA 


TAGATATTCT 


GACACCACTG 


ACTCTGATCC 


AG AG AAT GAA 


1920 


CCTTTTGATG 


AAGATCAGCA 


TACACAAATT 


ACAAAAGTCT 


GA 




1962 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1291 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

CCGCCCGCCG CCAGGCCCGG GGCCGCCTGC AGCCTGCGGA GGAGGCCGCG CCGCCCGCCG 60 

CTCCTGCCGT CTCTCTCCTC CTTCCTCTCC AGCCACCGGC TCCCAGACAT GACAGCCATC 120 

AT CAAG GAGA TCGTCAGCAG AAACAAAAGG CGCTACCAGG AGGATGGGTT CGACTTGGAC 180 

TTGACCTATA TTTATCCCAA CATTATTGCT ATGGGGTTTC CTGCAGAAAG ACTTGAAGGC 240 

GTATACAGGA ACAATATTGA TGATGTAGTA AGGTTTTTGG ATTCAAAGCA TAAAAACCAT 300 

TACAAGATAT ACAATCTGTG TGCTGAAAGA CATTATGATA CCGCCAAATT TAACTGCAGA 360 

GTTGCACAGT ATCCTTTTGA AGACCATAAT CCACCACAGC TAGAACTTAT CAAACCCTTT 420 

TGTGAAGATC TTGACCAATG GCTAAGTGAA GATGACAATC ATGTTGCAGC AATT CACTGT 480 

AAAGCTGGAA AGGGACGAAC TGGTGTAATG ATTT GTGCAT ATTTATTACA TCGGGGCAAA 54 0 

TTTCTAAAGG CACAAGAGGC CCTAGATTTC TATGGGGAAG TAAGGACCAG AGACAAAAAG 600 
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GGAGTAACTA 


TTCCCAGTCA 


GAGGCGCTAT 


GT GTATT ATT 


ATAGCTACCT 


GTTAAAGAAT 


C C f\ 


CATCTGGATT 


ATAGACCAGT 


GGCACTGTTG 


TTTCACAAGA 


T GAT GTTT GA 


AACTATTCCA 


/ZU 


ATGTTCAGTG 


GCGGAACTTG 


CAATCCTCAG 


TTTGTGGTCT 


GCCAGCTAAA 


GGT GAAGAT C 


Ton 


TATTCCTCCA ATTCAGGACC 


CACACGACGG 


GAAGACAAGT 


TCATGTACTT 


TGAGTTCCCT 


84 0 


CAGCCATTGC 


CTGTGTGCGG 


TGACATCAAA 


GTAGAGTTCT 


TCCACAAACA 


GAACAAGATG 


900 


CTAAAAAAGG 


ACAAAATGTT 


TCACTTTTGG 


GTAAACACAT 


TCTTCATACC 


AGGACCAGAG 


960 


GAAACCTCAG 


AAAAAGTAGA 


AAAT GGAAGT 


CTATGTGATC 


AAGAAATTGA 


TAGTATTTGC 


1020 


AGTATAGAAC 


GTGCAGATAA 


TGACAAGGAA 


TAT CT AGTAC 


TCACTTTAAC 


AAAAAATGAT 


1080 


CTC GACAAAG 


CAAATAAAGA 


CAAGGCCAAC 


CGATATTTTT 


CTCCAAATTT 


TAAGGTGAAG 


1140 


CTGTACTTCA 


CAAAAACTGT 


AGAGGAGCCA 


TCAAACCCGG 


AGGCTAGCAG 


TTCAACTTCT 


1200 


GTGACGCCAG 


ATGTTAGTGA 


CAATGAACCT 


GATCATTATA 


GAT AT T CT GA 


CACCACTGAC 


1260 


TCTGACCCAG 


AGAATGAACC 


CTTTGATGAA 


G 






1291 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 742 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Ser Pro Arg Pro Ala Arg Ser Arg Pro Pro Leu Ala Arg Leu Pro Pro 
1 5 10 15 

Pro Leu Gly Leu Pro Arg Arg Pro Gly Ser Arg Arg Gly Gly Gly Gly 
20 25 30 

Gly Gly Gin Ala Gly Gly Arg Cys Gly Arg Thr Leu Tyr Ala Leu Arg 
35 40 45 

Gin Asp Thr Arg Ser Ala Leu Gly Arg Asp Cys Ala Gin Phe Ser Pro 
50 55 60 

Leu Gly Ser Cys Ser His Asp Gly Ser Leu Arg Val Glu Pro Leu Gly 
65 70 75 80 

Glu Ala Gly Leu Arg Arg Gly Arg Glu Thr Ala Ala Ala Ala Ala Arg 
85 90 95 

Ser Pro Ser Gin Arg Leu Ala Ala Ala Gly Ala Ala Pro Ser Gly Ser 
100 105 110 

Arg Pro Ala Cys Gly Gly Gly Ser Gly Gly Val Ser Arg Leu Leu Phe 
115 120 125 

Val Phe Ser Asn Arg Ala Ala Ser Ser Ser Ala Ser Pro Glu Arg Glu 
130 135 140 

Gly Ser Arg Gly Leu Gly Arg Glu Pro Ala Glu Ala Arg Arg Arg Arg 
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145 "0 155 leo 

Arg Arg His Leu Pro Leu Leu Glu Arg Gly Gly Glu Ala Ala Ala Ala 
165 170 175 

Ala Ala Ala Ala Ala Ala Ala Pro Gly Arg Gly Ser Glu Ser Pro Val 
180 185 190 

Thr lie Ser Arg Ala Gly Asn Ala Gly Glu Leu Val Ser Pro Leu Leu 
195 200 205 

Leu Pro Pro Thr Arg Arg Arg Arg Arg Arg His lie Gin Gly Pro Gly 
210 215 220 

Pro Val Leu Asn Leu Pro Ser Ala Ala Ala Ala Pro Pro Val Ala Arg 
225 230 235 240 

Ala Pro Glu Ala Ala Gly Gly Gly Ser Arg Ser Glu Asp Tyr Ser Ser 
245 250 255 

Ser Pro His Ser Ala Ala Ala Ala Ala Arg Pro Leu Ala Ala Glu Glu 
260 265 270 

Lys Gin Ala Gin Ser Leu Gin Pro Ser Ser Ser Arg Arg Ser Ser His 
275 280 285 

^ Ho Val Gln Hi Gln Ma Ala Ma G * u ^9 Gly Ala Ser 



300 



Ala Thr Ala Lys Ser Arg Ala He Ser He Leu Gln Lys Lys Pro Arg 

JU:5 310 315 320 

His Gln Gln Leu Leu Pro Ser Leu Ser Ser Phe Phe Phe Ser His Aro 

325 330 335 

Leu Pro Asp Met Thr Ala lie He Lys Glu He Val Ser Arg Asn Lys 

40 345 350 

Arg Arg Tyr Gln Glu Asp Gly Phe Asp Leu Asp Leu Thr Tyr He Tyr 

•" 5 360 365 

Pro Asn He He Ala Met Gly Phe Pro Ala Glu Arg Leu Glu Gly Val 

370 375 380 x 

Tyr Arg Asn Asn He Asp Asp Val Val Arg Phe Leu Asp Ser Lys His 

J85 390 395 400 

Lys Asn His Tyr Lys He Tyr Asn Leu Cys Ala Glu Arg His Tyr Asp 

405 410 415 

Thr Ala Lys Phe Asn Cys Arg Val Ala Gln Tyr Pro Phe Glu Asp His 

420 425 430 

Asn Pro Pro Gln Leu Glu Leu He Lys Pro Phe Cys Glu Asp Leu Asp 

4J;> 440 445 

Gln Trp Leu Ser Glu Asp Asp Asn His Val Ala Ala He His Cys Lys 

455 46O 

Ala Gly Lys Gly Arg Thr Gly Val Met He Cys Ala Tyr Leu Leu His 

470 475 48O 

Arg Gly Lys Phe Leu Lys Ala Gln Glu Ala Leu Asp Phe Tyr Gly Glu 

485 490 495 
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Val Arg Thr Arg Asp Lys Lys Gly Val Thr lie Pro Ser Gin Arg Arg 
500 505 510 

Tyr Val Tyr Tyr Tyr Ser Tyr Leu Leu Lys Asn His Leu Asp Tyr Arg 
515 520 525 

Pro Val Ala Leu Leu Phe His Lys Met Met Phe Glu Thr lie Pro Met 
530 535 540 

Phe Ser Gly Gly Thr Cys Asn Pro Gin Phe Val Val Cys Gin Leu Lys 
545 550 555 560 

Val Lys lie Tyr Ser Ser Asn Ser Gly Pro Thr Arg Arg Glu Asp Lys 
565 570 575 

Phe Met Tyr Phe Glu Phe Pro Gin Pro Leu Pro Val Cys Gly Asp lie 
580 585 590 

Lys Val Glu Phe Phe His Lys Gin Asn Lys Met Leu Lys Lys Asp Lys 
595 600 605 

Met Phe His Phe Trp Val Asn Thr Phe Phe lie Pro Gly Pro Glu Glu 
610 615 620 

Thr Ser Glu Lys Val Glu Asn Gly Ser Leu Cys Asp Gin Glu lie Asp 
625 630 635 640 

Ser lie Cys Ser lie Glu Arg Ala Asp Asn Asp Lys Glu Tyr Leu Val 
645 650 655 

Leu Thr Leu Thr Lys Asn Asp Leu Asp Lys Ala Asn Lys Asp Lys Ala 
660 665 670 

Asn Arg Tyr Phe Ser Pro Asn Phe Lys Val Lys Leu Tyr Phe Thr Lys 
675 680 685 

Thr Val Glu Glu Pro Ser Asn Pro Glu Ala Ser Ser Ser Thr Ser Val 
690 695 700 

Thr Pro Asp Val Ser Asp Asn Glu Pro Asp His Tyr Arg Tyr Ser Asp 
705 710 715 720 

Thr Thr Asp Ser Asp Pro Glu Asn Glu Pro Phe Asp Glu Asp Gin His 
725 730 735 

Thr Gin lie Thr Lys Val 
740 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 645 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

( D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Arg Glu Thr Ala Ala Ala Thr Ala Gin Ser Pro Ser Gin Arg Leu Ala 
15 10 15 

Ala Ala Gly Ala Ala Pro Ser Gly Ser Arg Pro Gly Gly Gly Gly Gly 
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20 25 30 

Ser Gly Gly Gly Pro Arg Leu Leu Val Val Cys Ser Asn Arg Ala Ala 
35 40 45 

Ser Glu Gin Glu Arg Asp Gly Gly Arg Ser Arg Gly Leu Glu Arg Glu 
3U 55 60 

Pro Ala Gin Ala Arg Arg Leu His Leu Pro Leu Leu Glu Arg Gly Gly 
65 70 75 so 

Glu Ala Ala Ala Ala Ala Pro Gly Arg Gly Ser Glu Ser Pro Val Thr 
85 go 95 

He Ala Arg Ala Gly Asn Ala Gly Glu Leu Leu Ser Pro Leu Leu Leu 
100 105 110 

Pro Pro Thr Arg Arg Arg Arg Arg Arg His Val Gin Gly Pro Gly Pro 
115 120 12 5 

Val Leu Ser Leu Pro Ser Ala Ala Ala Ala Pro Pro Leu Ala Arg Ala 
1JU 135 140 

Pro Glu Ala Ala Gly Gly Gly Ser Arg Cys Glu Asp Tyr Pro Ser Ser 
145 150 155 160 

Pro His Ser Ala Ala Ser Ala Ala Arg Pro Leu Ala Ala Glu Glu Lys 
165 170 1?5 

Gin Ala Gin Ser Leu Gin Pro Ser Ser Ser Arg Arg Ser Ser His Tyr 
180 10 5 190 

Pro Ala Ala Val Gin Gly Gin Ala Ala Ala Glu Arg Gly Ala Ser Ala 
195 200 205 

Thr Ala Lys Ser Arg Ala lie Ser lie Leu Gin Lys Lys Pro Arg His 

215 220 

Gin Gin Leu Leu Pro Ser Leu Ser Ser Phe Phe Phe Ser His Arg Leu 

230 235 240 

Pro Asp Met Thr Ala lie He Lys Glu He Val Ser Arg Asn Lys Arg 
245 250 255 

Arg Tyr Gin Glu Asp Gly Phe Asp Leu Asp Leu Thr Tyr lie Tyr Pro 
260 265 270 

Asn He lie Ala Met Gly Phe Pro Ala Glu Arg Leu Glu Gly Val Tyr 

d/D 280 



285 



Arg Asn Asn lie Asp Asp Val Val Arg Phe Leu Asp Ser Lys His Lys 
" u 295 300 

Asn His Tyr Lys He Tyr Asn Leu Cys Ala Glu Arg His Tyr Asp Thr 
JW> 310 315 320 

Ala Lys Phe Asn Cys Arg Val Ala Gin Tyr Pro Phe Glu Asp His Asn 
325 330 335 

Pro Pro Gin Leu Glu Leu He Lys Pro Phe Cys Glu Asp Leu Asp Gin 

345 



350 



Trp Leu Ser Glu Asp Asp Asn His Val Ala Ala He His Cys Lys Ala 

360 



365 
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Gly Lys Gly Arg Thr Gly Val Met lie Cys Ala Tyr Leu Leu His Arg 
370 375 380 

Gly Lys Phe Leu Lys Ala Gin Glu Ala Leu Asp Phe Tyr Gly Glu Val 
385 390 395 400 

Arg Thr Arg Asp Lys Lys Gly Val Thr lie Pro Ser Gin Arg Arg Tyr 
405 410 415 

Val Tyr Tyr Tyr Ser Tyr Leu Leu Lys Asn His Leu Asp Tyr Arg Pro 
420 425 430 

Val Ala Leu Leu Phe His Lys Met Met Phe Glu Thr lie Pro Met Phe 
435 440 445 

Ser Gly Gly Thr Cys Asn Pro Gin Phe Val Val Cys Gin Leu Lys Val 
450 455 460 

Lys lie Tyr Ser Ser Asn Ser Gly Pro Thr Arg Arg Glu Asp Lys Phe 
465 470 475 480 

Met Tyr Phe Glu Phe Pro Gin Pro Leu Pro Val Cys Gly Asp lie Lys 
485 490 495 

Val Glu Phe Phe His Lys Gin Asn Lys Met Leu Lys Lys Asp Lys Met 
500 505 510 

Phe His Phe Trp Val Asn Thr Phe Phe lie Pro Gly Pro Glu Glu Thr 
515 520 525 

Ser Glu Lys Val Glu Asn Gly Ser Leu Cys Asp Gin Glu lie Asp Ser 
530 535 540 

lie Cys Ser lie Glu Arg Ala Asp Asn Asp Lys Glu Tyr Leu Val Leu 
545 550 555 560 

Thr Leu Thr Lys Asn Asp Leu Asp Lys Ala Asn Lys Asp Lys Ala Asn 
565 570 575 

Arg Tyr Phe Ser Pro Asn Phe Lys Val Lys Leu Tyr Phe Thr Lys Thr 
580 585 590 

Val Glu Glu Pro Ser Asn Pro Glu Ala Ser Ser Ser Thr Ser Val Thr 
595 600 605 

Pro Asp Val Ser Asp Asn Glu Pro Asp His Tyr Arg Tyr Ser Asp Thr 
610 615 620 

Thr Asp Ser Asp Pro Glu Asn Glu Pro Phe Asp Glu Asp Gin His Ser 
625 630 635 640 

Gin lie Thr Lys Val 
645 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 430 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Pro Pro Ala Ala Arg Pro Gly Ala Ala Cys Ser Leu Arg Arg Arg Pro 
15 10 15 

Arg Arg Pro Pro Leu Leu Pro Ser Leu Ser Ser Phe Leu Ser Ser His 
20 25 30 

Arg Leu Pro Asp Met Thr Ala lie lie Lys Glu lie Val Ser Arg Asn 
35 40 45 

Lys Arg Arg Tyr Gin Glu Asp Gly Phe Asp Leu Asp Leu Thr Tyr lie 
50 55 60 

Tyr Pro Asn lie lie Ala Met Gly Phe Pro Ala Glu Arg Leu Glu Gly 
65 70 75 80 

Val Tyr Arg Asn Asn lie Asp Asp Val Val Arg Phe Leu Asp Ser Lys 
85 90 95 

His Lys Asn His Tyr Lys lie Tyr Asn Leu Cys Ala Glu Arg His Tyr 
100 105 no 

Asp Thr Ala Lys Phe Asn Cys Arg Val Ala Gin Tyr Pro Phe Glu Asp 
115 120 125 

His Asn Pro Pro Gin Leu Glu Leu lie Lys Pro Phe Cys Glu Asp Leu 
130 135 140 

Asp Gin Trp Leu Ser Glu Asp Asp Asn His Val Ala Ala He His Cys 
145 150 155 160 

Lys Ala Gly Lys Gly Arg Thr Gly Val Met lie Cys Ala Tyr Leu Leu 
165 170 175 

His Arg Gly Lys Phe Leu Lys Ala Gin Glu Ala Leu Asp Phe Tyr Gly 
180 185 190 

Glu Val Arg Thr Arg Asp Lys Lys Gly Val Thr He Pro Ser Gin Arg 
195 200 205 

Arg Tyr Val Tyr Tyr Tyr Ser Tyr Leu Leu Lys Asn His Leu Asp Tyr 
210 215 220 

Arg Pro Val Ala Leu Leu Phe His Lys Met Met Phe Glu Thr He Pro 
225 230 235 240 

Met Phe Ser Gly Gly Thr Cys Asn Pro Gin Phe Val Val Cys Gin Leu 
245 250 255 

Lys Val Lys He Tyr Ser Ser Asn Ser Gly Pro Thr Arg Arg Glu Asp 
260 265 270 

Lys Phe Met Tyr Phe Glu Phe Pro Gin Pro Leu Pro Val Cys Gly Asp 
275 280 285 

He Lys Val Glu Phe Phe His Lys- Gin Asn Lys Met Leu Lys Lys Asp 
290 295 300 

Lys Met Phe His Phe Trp Val Asn Thr Phe Phe He Pro Gly Pro Glu 
305 310 315 320 

Glu Thr Ser Glu Lys Val Glu Asn Gly Ser Leu Cys Asp Gin Glu He 
325 330 335 



BNSDOCIO <WO 9833907A1 I > 



WO 98/33907 PCTAJS98/I 

-109- 

Asp Ser lie Cys Ser lie Glu Arg Ala Asp Asn Asp Lys Glu Tyr Leu 

340 345 350 

Val Leu Thr Leu Thr Lys Asn Asp Leu Asp Lys Ala Asn Lys Asp Lys 
355 360 365 

Ala Asn Arg Tyr Phe Ser Pro Asn Phe Lys Val Lys Leu Tyr Phe Thr 
370 375 380 

Lys Thr Val Glu Glu Pro Ser Asn Pro Glu Ala Ser Ser Ser Thr Ser 
385 390 395 400 

Val Thr Pro Asp Val Ser Asp Asn Glu Pro Asp His Tyr Arg Tyr Ser 
405 410 415 

Asp Thr Thr Asp Ser Asp Pro Glu Asn Glu Pro Phe Asp Glu 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1257 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

CCTCCCCTCG CCCGGCGCGG TCCCGTCCGC CTCTCGCTCG CCTCCCGCCT CCCCTCGGTC 60 

TTCCGAGGCG CCCGGGCTCC CGGCGCGGCG GCGGAGGGGG CGGGCAGGCC GGCGGGCGGT 120 

GATGTGGCAG GACTCTTTAT GCGCTGCGGC AGGATACGCG CTCGGCGCTG GGACGCGACT 18 0 

GCGCTCAGTT CTCTCCTCTC GGAAGCTGCA GCCATGATGG AAGTTTGAGA GTTGAGCCGC 24 0 

TGTGAGGCGA GGCCGGGCTC AGGCGAGGGA GATGAGAGAC GGCGGCGGCC GCGGCCCGGA 300 

GCCCCTCTCA GCGCCTGTGA GCAGCCGCGG GGGCAGCGCC CTCGGGGAGC CGGCCGGCCT 360 

GCGGCGGCGG CAGCGGCGGC GTTTCTCGCC TCCTCTTCGT CTTTTCTAAC CGTGCAGCCT 420 

CTTCCTCGGC TTCTCCTGAA AGGGAAGGTG GAAGCCGTGG GCTCGGGCGG GAGCCGGCTG 480 

AGGCGCGGCG GCGGCGGCGG CGGCACCTCC CGCTCCTGGA GCGGGGGGGA GAAGCGGCGG 54 0 

CGGCGGCGGC CGCGGCGGCT GCAGCTCCAG GGAGGGGGTC TGAGTCGCCT GTCACCATTT 600 

CCAGGGCTGG GAACGCCGGA GAGTTGGTCT CTCCCCTTCT ACTGCCTCCA ACACGGCGGC 660 

GGCGGCGGCG GCACATCCAG GGACCCGGGC CGGTTTTAAA CCTCCCGTCC GCCGCCGCCG 720 

CACCCCCCGT GGCCCGGGCT CCGGAGGCCG CCGGCGGAGG CAGCCGTTCG GAGGATTATT 78 0 

CGTCTTCTCC CCATTCCGCT GCCGCCGCTG CCAGGCCTCT GGCTGCTGAG GAGAAGCAGG 840 

CCCAGTCGCT GCAACCATCC AGCAGCCGCC GCAGCAGCCA TTACCCGGCT GCGGTCCAGA 900 

GCCAAGCGGC GGCAGAGCGA GGGGCATCAG CTACCGCCAA GTCCAGAGCC ATTTCCATCC 960 

TGCAGAAGAA GCCCCGCCAC CAGCAGCTTC TGCCATCTCT CTCCTCCTTT TTCTTCAGCC 1020 
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ACAGGCTCCC AGACAT GAGA GCCATCATCA AAGAGATCGT TAGCAGAAAC AAAAGGAGATr 1080 

ATCAAGAGGA TGGATTCGAC TTAGACTTGA CCTGTATCCA TTTCTGCGGC TGCTCCTCTT 1140 

TACCTTTCTG TCACTCTCTT AGAACGTGGG AGTAGACGGA TGCGAAAATG TCCGTAGTTT 1200 

GGGTGACTAT AACAT TTAAC CCTGGTCAGG TTGCTAGGTC ATATATTTTG TGTTTCC 1257 

(2) INFORMATION FOR SEQ ID NO: 9: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1084 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

GAGACATAGC CAGCTCTTAA ATCTGACTTC CAGATTTTCA CTGTGTCTTC TTTTTTCTGT 60 

AACGTGTTGC CTTTTTTAGC CATGAAAAAT TAGAAGTTGA ACTCTTGTCT TTTCAGGCAG 120 

GTGTCAATTT TGGGGTTTTG TTTTGATTTT TGGTTTTTGA CATAAAGTAC TTTAGTTCTG 180 

TGATGTATAA ACCGTGAGTT TCTGTTTTTC T CAT AT AC CT GAATACTGTC CATGTGGAAG 240 

TTACCTTTTA TCTTTACCAG TATTAACACA TAAAT GGTTA TACATAAATA CATTGACCAC 300 

CTTTTATTAC TCCAGCTATA GTGGGGAAAG CTTTCTTTTC ATAACTAGCT AATGTTTTAA 360 

AAAGTATTCT TTTAGTTTGA TTGCTGCATA TTT CAGAT AT TTCTTTCCTT AACTAAAGTA 420 

CTCAGATATT TATCCAAACA TTATTGCTAT GGGATTTCCT GCAGAAAGAC TTGAAGGCGT 480 

ATACAGGAAC AATATTGATG AT GTAGTAAG GTAAGAATGC TTTGATTTTC TATTT CAAAT 54 0 

ATTGATGTTT AT ATT CAT GT TGTGTTTTCA TTTAGAAAAG ATTTCTAAGC CACAGAAAAA 600 

GATACTTTGT GAT GTAAACT ATTATT GTAG TGCTCTATAA TCATTTTTTG GCTTACCGTA 660 

CCTAATGGAC TTCAGGGGGA TACAGTTCAT TTGATAAGAA CTGACCTTAT ACATTACATA 720 

ATCAGGTACT TAT GT GATAT CATTTCCTGG ACT C CATAAA ATGCTGGTCA CCAGGTTTAA 780 

TACCTGGATT CCATTACAGT GT GATTTTT G TCTTATTTCA TAGTTGGGGA TTAGGCTTAA 84 0 

AATCCTAGAG TGGATTTATT CAGTTAAATT TATTCACACT AAGAT GT GAT GACTAATACT 900 

GTATATTTTT AT GTAGACCA AATTTTAAGG TACCACTGTG CATAT GTTAC CAACTACCTG 960 

AAGAATATTT GGTTGGTACA GAATATATAA AGGAATCGCT GGTGTTCCAA GGCTAATCCA 1020 

GTTTTATAAT TTTGCATAAT TTCCTAACTG CGAATATCAT TTATTTAAAC AATTTATTCT 1080 
CCAG 



(2) INFORMATION FOR SEQ ID NO: 10: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1104 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 



1084 
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(D) TOPOLOGY: linear 
<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

GAATTAATAG TTAGTACGTG GATCTTTCAA AT AT CAAAAG TTTTCAGTTT GAT G GGAAAA 60 

TGATGTCTGA ATTTTCAGGG TTATTTTTAA GAGTACTTGA TTATGACTGT CTTGTAAATC 120 

TCTATGAGCT AGGTATACTT GCACTAAATG CTAATGCTTT TTAAAGAAGT TATGTCTTAA 180 

TATTCAGTCT CATTATGTTA GGTTGAAGAT AGAAGATTAT GAAAATATTC TCTGAAAAGC 240 

TCTGGTTTTA CTT CAGATTG TATAAATCTG TGTAATGTAA TAATTATTTA AGAATGACAT 300 

GATTACTACT CTAAAC CCAT AGAAGGGGTA TTTGTTGGAT TATTTATTTT CACTTAAATG 360 

GTATTT GAGA TT AG GAAAAA GAAAATCTGT CTTTTGGTTT TT CTT GAT AG TATTAATGTA 420 

ATTTCAAATG TTAGCTCATT TTTGTTAATG GTGGCTTTTT GTTTGTTTGT TTTGTTTTAA 48 0 

GGTTTTTGGA TTCAAAGCAT AAAAACCATT ACAAGATATA CAATCTGTAA GTATGTTTTC 54 0 

TT AT TTGTAT GCTTGCAAAT AT CTT CT AAA ACAACTATTA AGT GAAAGTT ATCTGCTTGT 600 

T AGAGT GAG G TAGAGTTAAA GATACATTTT AACAGAATTG TATT CCTAAA CCGATTAAGT 660 

CAAGAAGTCC AAGAGCATTG TTAGATCATT TAGAAAGTGT AGT GAT GAGG TAAAACATTG 720 

TTGGCACAGA TTCATGTTAC TTGATCTGCT TTAAATGACT TGGCATCTAG CCCATATTTG 7 80 

AGCC CATAAC CGTGTGGTAA TTTGAAGTGT AATT CACAGT AGAGCTTCTG TTAAAGCACT 84 0 

AATAGCAT CT TCCATGGAGG TATACTT GAG AGT GAAT AT A ATTTTGTTTA TCCTGTGTCT 900 

CTAGAG CTAT TGACTGAAAA AGCTGTTAGG GCATTCTCTA ACTGTACATC ACCTAAGTTA 960 

TTTAAAATTG CTGAATTAAG TGGCTTGTCT TGTCTAGACA GATTTTAAGG ACTGCCCACC 1020 

T GAT T GAT AG AACTAGTTGA CCTTATCTTT AACTTTTTGT TTTCTTTTGA CTTGGGATAA 108 0 

AAGTTGAAAA GGTAAAAGGA AGGA 1104 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 656 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

TTGCATACAC TTAATCTTTT AAGCTTTGGT TTTATTATTA TAATATGGGG GTGATAACAG 60 

TATCTACTTA AT AGAATT CT TGTTATTAAC AT GAAAT AAT TAATGTTAAA CACAGCATAA 120 

TATGTGTCAC ATTATAAAGA TTCAGGCAAT GTTTGTTAGT ATTAGTACTT TTTTTTCTTC 180 

CTAAGTGCAA AAGATAACTT TATATCACTT TTAAACTTTT CTTTTAGTTG TGCTGAAAGA 240 

CATTATGACA CCGCCAAATT TAATT GCAGA GGTAGGTATG AATGTACTGT ACTATGTTGT 300 

ATAACTTAAA CCCGATAGAC TGTATCTTAC TGTCATAACA ATAATGAGTC ATCCAGATTA 360 
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T CGAGT GAGA TACATATTTA TCTTAAGAAT TAT CTTTAAA AATTT CAAAA ATTTTAATTT 
TACTGTTGTG TTTTAGGAAA AAGTATTGCA TAAAGCTATT AATATTGTCA GGAAGACTAA 
AGT GCAGCAT AGACTAAGCA ATCAGGAAAA TTCCTAGACT AAAAATAGTA TAAGGAGAGG 
GTTTACCTAC TATTTGAGGC AGTTGGTCTA ATAGTAAGCA ATCACAGGGA GGAAAGCAGA 
AACTACTTAA CTCTTCTGTG TTGAGGAATG ACATAAAAGG TAT GAAAGGA TATAAC 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 808 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/ KEY: modif ied_base 
<B) LOCATION: 4 63. .754 

(D) OTHER INFORMATION :/note= M N « C, G, A or T" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



ATACATTATT 


TTTCTCTGGA 


AT CGAGT GTT 


TCTTTTAAAT 


ACCTGTTAAG 


TTTGTATGCA 


60 


ACATTT CTAA 


AGTTACCTAC 


TTGTTAATTA 


AAAATTCAAG 


GGTTTTTTTT 


TCTTATTCTG 


120 


AGGTTAT CTT 


TTTACCACAG 


TTGCACAATA 


TCCTTTTGAA 


GACCATAACC 


CACCACAGCT 


180 


AGAACTTATC 


AAACCCTTTT 


GTGAAGATCT 


TGACCAATGG 


CTAAGT GAAG 


AT GACAAT CA 


240 


TGTTGCAGCA 


ATTCACTGTA 


AAGCT GGAAA 


GGGACGAACT 


GGTGTAATGA 


TAT GTGCATA 


300 


TTTATTACAT 


CGGGG CAAAT 


TTTTAAAGGC 


ACAAGAGGCC 


CTAGATTTCT 


ATGGGGAAGT 


360 


AAGGACCAGA 


GACAAAAAGG 


TAAGTTATTT 


TTTGATGTTT 


TTCCTTTCCT 


CTTCCTGGAT 


420 


CTGAGAATTT 


ATTGGAAAAC 


AGATTTTGGG 


TTTCTTTTTT 


TCNNNNNNNN 


NNNNNNNNNN 


480 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


540 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


600 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


660 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


720 


NNNNNNNNNN 


NTCCTCCCTC 


CCCACCCTCA 


GTCNCTGGAA AACAGGTTTT 


AAAGATAGTT 


780 


GCTAATCCTT 


ATTTCTTCTA 


AATTTTTA 
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(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 670 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

AT AT GAT AAT TGTTTTAAGG GAGGAGAGTT ATTCTGATAT CCTTGTATTG ATATTGCTCT 60 

TATTTATTAT TGAGCTGGAT TTAAGTATTA AT CATTTAAG GTCAAATTTC TAAT GTATAA 120 

TATGTTCTTA AATGGCTACG ACCCAGTTAC CATAGCAATT T AGT GAAAT A ACT AT AAT GG 180 

AACATTTTTT TTCAATTT GG CTTCTCTTTT TTTTCTGTCC ACCAGGGAGT AACTATT CCC 24 0 

AGTCAGAGGC GCTATGTGTA TTATTATAGC TACCTGTTAA AGAAT CAT CT GGATTATAGA 300 

CCAGTGGCAC TGTTGTTTCA CAAGAT GAT G TTT GAAACTA TTCCAATGTT CAGTGGCGGA 360 

ACTTGCAGTA AGTGCTTGGA AATT CTCAT C CTTCCATGTA TTGGAACAGT TTTCTTAACC 420 

ATATCTAGAA GTTTACATAA AAATTTAGAA AAGAAATTTA CCACATTTGA AATTTATGCA 4 80 

GGAGACTATA TTTCTGAAGC ATT T GAACAA ATTAATTAGC TTTGTTGTTC AACTCATTGG 540 

GCTAAAGAAG CCAAAAGCAA TGGGTTTTAA TGTAGTCGAA GCCAAATTAT ATTTAT GAAA 600 

GAAATATTCT GTGTTATAAC CCAC CAAAT A CAGCCCAATT TCT GACTAGA TGTATGGAAG 660 

AACCTGTCCC 670 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 661 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

ATATTTTTAT TTCATTTATT TCAGTTGATT TGCTTGAGAT CAAGATTGCA GATACAGAAT 60 

CCATATTTCG TGTATATTGC TGATATTAAT CATTAAAATC GTTTTTGACA GTTTGACAGT 120 

TAAAGGCATT TCCTGTGAAA TAATACT GGT ATGTATTTAA CCAT GCAGAT CCTCAGTTTG 180 

TGGTCTGCCA GCTAAAGGTG AAGATATATT CCTCCAATTC AGGACCCACA CGACGGGAAG 24 0 

ACAAGTT CAT GTACTTTGAG TTCCCTCAGC CGTTACCTGT GTGTGGTGAT AT CAAAGT AG 300 

AGTTCTTCCA CAAACAGAAC AAGAT GCTAA AAAAGGTTTG TACTTTACTT TCATTGGGAG 360 

AAATAT CCAA AATAAGGACA GATTAAAAGC TATATTTTAT TTTATGACAT GTAAGGAACT 420 

ATAATTTGTT TTCTATTAGA TCTGCAGGTG TTTTGCTTAC TCTGGCATTG GTGAGACATT 480 

ATAAGGGTAA ATAATCCTGT TTGAAGGAAA AGGCCTTATG GCATTGTAAC ATTAGAGGAA 540 

TTTTTCTTAA CAAGGATGGT TAACTGAGAA GAAAT TAG CA T GGGAC CAAT ATTTTAAAAA 600 

TTTTTGGTCT ATAGGTAGAA ATGAGATCTG TTCTGTGGTC TTATGTAGTG ACACAAACCA 660 

C 661 



(2) INFORMATION FOR SEQ ID NO: 15: 
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(i) SEQUENCE CHARACTERISTICS: = 

(A) LENGTH: 739 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

GTGTTCACCT TTATTCAGAA TATCAAATGA TAGTTTATTT TGTTGACTTT TTGCAAATGT 60 

TTAACATAGG TGACAGATTT TCTTTTTTAA AAAAATAAAA CATCATTAAT TAAATATGTC 120 

ATTTCATTTC TTTTTCTTTT CTTTTTTTTT TTTTTTTAGG ACAAAATGTT TCACTTTTGG 180 

GTAAATACAT TCTTCATACC AGGACCAGAG GAAACCTCAG AAAAAGTAGA AAAT GGAAGT 240 

CTATGTGATC AAGAAATCGA TAGCATTTGC AGTATAGAGC GTGCAGATAA TGACAAGGAA 300 

TAT CTAGTAC TTACTTTAAC AAAAAATGAT CTT GACAAAG CAAATAAAGA CAAAGCCAAC 360 

CGATACTTTT CTCCAAATTT TAAGGTCAGT TAAATTAAAC ATTTTGTGGG GGTTGGTGAC 420 

TTGTATGTAT GTGATGTGTG TTTAATTCTA GGAGTACAGC TGATGAAGAA CTTGCTTGAC 480 

AAGTTTTTAA CTTAT GTATT ATTTCGAAGC AGT GTTTACG TAGCAGTAAC ATGAAAGTTT 540 

CTAATAAAAT ACCCAATGTA CACAGCGTCA AAAAAGCTGC ATTTTTCCTT TTCCTAATTC 600 

TTTGTTGTTT GCTGAAATCT GGGGCAAAGG TGCGGGAGGG GGCTAAATGA CTGGGATATG 660 

AAGTAGGAAT GGGAGAGGAA AGAAATAGAT GGGAACTCAG TCATTTGGGA ATGATTCATA 720 

TGGAATGTTT TTACTGCTT 739 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 970 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

ATGAGCCAAG ATCATGCCAC TGCACTCCAG CTTGGCAACA GAGCAAGACT CTTGTCTCCA 60 

GAAATAGAAA AT AAAT AAAT TGTATTAACA TCCTGATAGT TTATCTGTCT AGTACCTAGC 120 

AAGAAAGAAA ATGTTGAACA TCTTAAGAAG AGGGTCATTT AAAAGGCCTC TTAAAAGATC 180 

AT GTTTGTT A CAGTGCTTAA AAATTAATAT GTT CAT CTGC AAAATGGAAT AAAAAATCTG 240 

TTAAAAATAT ATTTCACTAA ATAGTTTAAG AT GAGT CAT A TTTGTGGGTT TT CATTTTAA 300 

ATTTTCTTTC TCTAGGTGAA GCTGTACTTC ACAAAAACAG TAGAGGAGCC GT CAAATCCA 360 

GAGGCTAGCA GTTCAACTTC TGTAACACCA GATGTTAGTG ACAATGAACC TGAT CATTAT 420 

AGATATTCTG ACACCACTGA CTCTGATCCA GAGAATGAAC CTTTTGATGA AGATCAGCAT 480 

ACACAAATTA CAAAAGTCTG AATTTTTTTT TATCAAGAGG GATAAAACAC CAT GAAAAT A 540 

AACTTGAATA AACTGAAAAT GGACCTTTTT TTTTTTAATG GCAATAGGAC ATTGTGTCAG 600 
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ATTACGAGTT 


J\ 1 >VVjoM/\w\rV 


1 1 tl tl 1 1 1^ 


CTGACCAATC 


TTGTTTTACC 


CTAT ACAT CC 


660 


TV f TV r"/™f , l l, P f P , P 

AUAUlaCa 1111 




O J. Uw\VJ X X 


AAAAAGGTTG 


TGTAGCTGTG 


T CAT GT AT AT 


720 


ACCTTTTTGT 


GTCAAAAGGA 


CATTTAAAAT 


TCAATTAGGA 


TTAATAAAGA 


TGGCACTTTC 


780 


CCGTTTTATT 


CCAGTTTTAT 


AAAAAGTGGA 


GACAGACTGA 


TGTGTATACG 


TAGGAATTTT 


840 


TTCCTTTTGT 


GTTCTGTCAC 


CAACTGAAGT 


GGCTAAAGAG 


CTTTGTGATA 


TACTGGTTCA 


900 


CATCCTACCC 


CTTTGCACTT 


GTGGCAACAG 


ATAAGTTTGC 


AGTTGGCTAA 


GAGAGGTTTC 


960 


CGAAAGGTTT 












970 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

lie His Cys Lys Ala Gly Lys Gly Arg Thr Gly 
15 10 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/ KEY: Modified- site 

(B) LOCATION : 1 

(D) OTHER INFORMATION : / note= "may be either I = 
Isoleucine or V = Valine" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATIONS. .8 

(D) OTHER INFORMATION: /note= "X = Any amino acid" 

(ix) FEATURE: 

(A) NAME/ KEY: Modif ied-site 

(B) LOCATION: 10 

(D) OTHER INFORMATION: /note= "may be either S = Serine or 
T « Threonine" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

lie His Cys Xaa Ala Gly Xaa Xaa Arg Ser Gly 
15 10 



(2) INFORMATION FOR SEQ ID NO: 19: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 60 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Met Thr Ala lie lie Lys Glu He Val Ser Arg Asn Lys Arg Arg Tyr 
1 5 10 - * 



15 



Gin Glu Asp Gly Phe Asp Leu Asp Leu Thr Tyr He Tyr Pro Asn He 
20 25 3 0 

He Ala Met Gly Phe Pro Ala Glu Arg Asn Glu Gly Val Tyr Arg Asn 



45 

Asn He Asp Asp Val Val Arg Phe Leu Asp Ser Lys 
50 55 6 y Q 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Met Thr Ala He lie Lys Glu He Val Ser Arg Asn Lys Arg Arg Tyr 

10 15 

Gin Glu Asp Gly Phe Asp Leu Asp Leu Thr Tyr He Tyr Pro Asn lie 
20 25 30 

He Ala Met Gly Phe Pro Ala Glu Arg Leu Glu Gly Val Tyr Arg Asn 



45 



35 40 

Asn He Asp Asp Val Val Arg Phe Leu Asp Ser Lvs 
50 55 6u 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Met Thr Ala He lie Lys Glu lie Val Ser Arg Asn Lys Arg Arg Tyr 

Gin Glu Asp Gly Phe Asp Leu Asp Leu Thr Tyr He Tyr Pro Asn He 
20 25 3 0 

He Ala Met Glu Phe Pro Ala Glu Arg Leu Glu Gly Val Tyr Arg Asn 
35 40 45 

Asn lie Asp Asp Val Val Arg Phe Leu Asp Ser Lvs 
50 55 6 y 0 
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CLAIMS 

1. An isolated polypeptide encoding a tumor suppressor designated as TS10q23.3. 

2. The polypeptide of claim 1, wherein the tumor suppressor has the amino acid sequence 
as set forth in FIG. 7. 

3. The polypeptide of claim 1, wherein the tumor suppressor has an amino acid sequence as 
set forth in FIG. 9. 

4. An isolated peptide having between about 10 and about 50 consecutive residues of a 
tumor suppressor designated as TS 1 0q23 .3 . 

5. The peptide of claim 4, wherein the peptide is conjugated to a carrier molecule. 

6. The peptide of claim 5, wherein the carrier molecule is selected from the group 
consisting of KLH and BSA. 

7. The peptide of claim 4, wherein the tumor suppressor has the amino acid sequence as set 
forth in FIG. 7. 

8. The peptide of claim 4, wherein the tumor suppressor has an amino acid sequence as set 
forth in FIG. 9. 

9. A monoclonal antibody that binds immunologically to a tumor suppressor designated as 
TS10q23.3. 

10. The monoclonal antibody of claim 9, wherein the antibody does not bind 
immunologically to other human polypeptides. 

11. The monoclonal antibody of claim 9, wherein the antibody binds to a non-human 
TS10q23.3 and does not bind to human TS10q23.3. 



WO 98/33907 PCT/US98/00353 

- 118 - 

12. The monoclonal antibody of claim 9, wherein the antibody further comprises a detectable 
label. 



13. The monoclonal antibody of claim 12, wherein the label is selected from the group 
5 consisting of a fluorescent label, a chemiluminescent label, a radiolabel and an enzyme. 

14. A hybridoma cell that produces a monoclonal antibody that binds immunologically to a 
tumor suppressor designated as TS10q23.3. 

10 15. The hybridoma cell of claim 14, wherein the antibody does not bind immunologically to 
other human polypeptides. 

16. The hybridoma cell of claim 14, wherein the antibody binds to a non-human TS10q23.3 
and does not bind to human TS 1 0q23 .3 . 

15 

17. A polyclonal antisera, antibodies of which bind immunologically to a tumor suppressor 
designated as TS10q23.3. 

18. The antisera of claim 17, wherein said antisera is derived from an animal other than 
20 human, mouse or dog. 

19. An isolated nucleic acid comprising a region, or the complement thereof, encoding a 
tumor suppressor designated TS10q23.3 or an allelic variant or mutant thereof. 

25 20. The nucleic acid of claim 19, wherein the tumor suppressor coding region is selected 
from the group consisting of murine, canine and human. 

21. The nucleic acid of claim 20, wherein the tumor suppressor coding region is human. 

30 22. The nucleic acid of claim 19, wherein the mutant is a deletion mutant, an insertion 
mutant, a frameshift mutant, a nonsense mutant, a missense mutant or splice mutant. 
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23. The nucleic acid of claim 22, wherein said tumor suppressor has the amino acid sequence 
of FIG. 9. 

24. The nucleic acid of claim 22, wherein said tumor suppressor has the amino acid sequence 
5 of FIG. 9. 

25. The nucleic acid of claim 23, wherein said tumor suppressor has the amino acid sequence 
of FIG. 9. 

10 26. The nucleic acid of claim 24, wherein said nucleic acid comprises the coding sequence of 
FIG. 9 or the complement thereof. 

27. The nucleic acid of claim 25, wherein said nucleic acid comprises the coding sequence of 
FIG. 9 or the complement thereof. 

15 

28. The nucleic acid of claim 21, wherein said nucleic acid is selected from the group 
consisting of genomic DNA, complementary DNA and RNA. 

29. The nucleic acid of claim 28, wherein said nucleic acid is a complementary DNA and 
20 further comprises a promoter operably linked to said region, or the complement thereof, 

encoding said tumor suppressor. 

30. The nucleic acid of claim 29, further comprising a polyadenylation signal operably linked 
to said region encoding said tumor suppressor. 

25 

31. The nucleic acid of claim 30, further comprising an origin of replication. 

32. The nucleic acid of claim 31, wherein said nucleic acid is a viral vector selected from the 
group consisting of retrovirus, adenovirus, herpesvirus, vaccinia virus and adeno-associated 

30 virus. 

33. The nucleic acid of claim 32, wherein said nucleic acid is packaged in a virus particle. 
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34. The nucleic acid of claim 3 1 , wherein said nucleic acid is packaged in a lip 



osome. 



35. The nucleic acid of claim 21, wherein the nucleic acid is of a size selected from the group 
consisting of about 1212 bases, about 1500 bases, about 2000 bases/about 3500 bases, about 
5000 bases, about 10,000 bases, about 15,000 bases, about 20,000 bases, about 25,000 bases, 
about 30,000 bases, about 35,000 bases, about 40,000 bases, about 45,000 bases, about 50,000 
bases, about 75,000 bases and about 100,000 bases. 

10 36. An isolated oligonucleotide of between about 10 and about 50 consecutive bases of a 
nucleic acid, or complementary thereto, encoding a tumor suppressor designated as TS10q23.3. 

37. The oligonucleotide of claim 36, wherein the tumor suppressor is murine. 

15 38. The oligonucleotide of claim 36, wherein the tumor suppressor is human. 

39. The oligonucleotide of claim 37, wherein the nucleic acid is the coding region of FIG. 9. 

40. The oligonucleotide of claim 38, wherein the nucleic acid is the coding region of FIG. 9. 

41. The oligonucleotide of claim 36, wherein the oligonucleotide is about 15 bases in length. 

42. The oligonucleotide of claim 36, wherein the oligonucleotide is about 17 bases in length. 
25 43. The oligonucleotide of claim 36, wherein the oligonucleotide is about 20 bases in length. 

44. The oligonucleotide of claim 36, wherein the oligonucleotide is about 25 bases in length. 

45. The oligonucleotide of claim 36, wherein the oligonucleotide is about 50 bases in length. 

46. A method of diagnosing a cancer comprising the steps of: 
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(i) obtaining a sample from a subject; and 

(ii) determining the expression a functional TS10q23.3 tumor suppressor 
in cells of said sample. 

5 47. The method of claim 46, wherein said cancer is selected from the group consisting of 
brain, lung, liver, spleen, kidney, lymph node, small intestine, pancreas, blood cells, colon, 
stomach, breast, endometrium, prostate, testicle, ovary, skin, head and neck, esophagus, bone 
marrow and blood cancer. 

48. The method of claim 47, wherein said cancer is prostate cancer or breast cancer. 
10 49. The method of claim 47, wherein said cancer is a brain cancer. 

50. The method of claim 49, wherein said brain cancer is a glioma. 

51. The method of claim 46 said sample is a tissue or fluid sample. 

52. The method of claim 46, wherein said determining comprises assaying for a nucleic acid 
from said sample. 

15 53. The method of claim 52, further comprising subjecting said sample to conditions suitable 
to amplify said nucleic acid. 

54. The method of claim 46, wherein said determining comprises contacting said sample with 
an antibody that binds immunologically to a TS 1 0q23 .3 . 

55. The method of claim 54, further comprising subjecting proteins of said sample to ELISA. 

20 56. The method of claim 46, further comprising the step of comparing the expression of 
TS10q23.3 with the expression of TS10q23.3 in non-cancer samples. 

57. The method of claim 56, wherein the comparison involves evaluating the level of 
TS10q23.3 expression. 

58. The method of claim 56, wherein the comparison involves evaluating the structure of the 
25 TS10q23.3 gene, protein or transcript. 
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59. The method of claim 58, wherein said evaluating is an assay selected from the group 
consisting of sequencing, wild-type oligonucleotide hybridization, mutant oligonucleotide 
hybridization, SSCP, PCR and RNase protection. 

60. The method of claim 59, wherein a said evaluating is wild-type or mutant oligonucleotide 
5 hybridization and said oligonucleotide is configured in an array on a chip or wafer. 

61 A method for altering the phenotype of a tumor cell comprising the step of contacting the 
cell with a tumor suppressor designated TS10q23.3 under conditions permitting the uptake of 
said tumor suppressor by said tumor cell. 

62. The method of claim 61, wherein said tumor cell is derived from a tissue selected from 
10 the group consisting of brain, lung, liver, spleen, kidney, lymph node, small intestine, blood cells, 

pancreas, colon, stomach, breast, endometrium, prostate, testicle, ovary, skin, head and neck, 
esophagus, bone marrow and blood tissue.. 

63. The method of claim 61, wherein the a phenotype is selected from the group consisting 
of proliferation, migration, contact inhibition, soft agar growth and cell cycling. 

15 64. The method of claim 6 1 , wherein said tumor suppressor is encapsulated in a liposome. 

65. A method for altering the phenotype of a tumor cell comprising the step of contacting the 
cell with a nucleic acid (i) encoding a tumor suppressor designated TS10q23.3 and (ii) a 
promoter active in said tumor cell, wherein said promoter is operably linked to the region 
encoding said tumor suppressor, under conditions permitting the uptake of said nucleic acid by 

20 said tumor cell. 

66. The method of claim 64, wherein said tumor cell is derived from a tissue selected from 
the group consisting of brain, lung, liver, spleen, kidney, lymph node, small intestine, blood cells, 
pancreas, colon, stomach, breast, endometrium, prostate, testicle, ovary, skin, head and neck, 

25 esophagus, bone marrow and blood tissue. 

67. The method of claim 65, wherein the a phenotype is selected from the group consisting 
of proUferation, migration, contact inhibition, soft agar growth or cell cycling. 

68. The method of claim 65, wherein said nucleic acid is encapsulated in a liposome. 
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69. The method of claim 65, wherein said nucleic acid is a viral vector selected from the 
group consisting of retrovirus, adenovirus, adeno-associated virus, vaccinia virus and 
herpesvirus. 

5 

70. The method of claim 69, wherein said nucleic acid is encapsulated in a viral particle. 

7L A method for treating cancer comprising the step of contacting a tumor cell within a 
subject with a tumor suppressor designated TS10q23.3 under conditions permitting the uptake 
10 of said tumor suppressor by said tumor cell. 

72. The method of claim 71, wherein said tumor cell is derived from a tissue selected from 
the group consisting of brain, lung, liver, spleen, kidney, lymph node, small intestine, blood cells, 
pancreas, colon, stomach, breast, endometrium, prostate, testicle, ovary, skin, head and neck, 
esophagus, bone marrow and blood tissue. 

15 73 . The method of claim 7 1 , wherein the subject is a human. 

74. A method for treating cancer comprising the step of contacting a tumor cell within a 
subject with a nucleic acid (i) encoding a tumor suppressor designated TS10q23.3 and (ii) a 
promoter active in said tumor cell, wherein said promoter is operably linked to the region 
encoding said tumor suppressor, under conditions permitting the uptake of said nucleic acid by 

20 said tumor cell. 

75. The method of claim 74, wherein said tumor cell is derived from a tissue selected from 
the group consisting of brain, lung, liver, spleen, kidney, lymph node, small intestine, blood cells, 
pancreas, colon, stomach, breast, endometrium, prostate, testicle, ovary, skin, head and neck, 

25 esophagus, bone marrow and blood tissue. 

76. The method of claim 74, wherein the subject is a human. 

77. A transgenic mammal in which both copies of the gene encoding TS10q23.3 are 
interrupted or replaced with another gene. 

78. A method of determining the stage of cancer comprising the steps of: 
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(i) obtaining a sample from a subject; and 

(ii) determining the expression a functional TS 1 0q23 .3 tumor suppressor 
in cells of said sample. 

5 

79. The method of claim 78, wherein the cancer is brain cancer and the stage is distinguished 
between low grade and glioma. 

80. The method of claim 78, wherein the determining comprises assaying for a TSl0q23.3 
10 nucleic acid or polypeptide in said sample. 



81. A method of predicting tumor metastasis comprising the steps of: 

15 (i) obtaining a sample from a subject; and 

(ii) determining the expression a functional TS 1 0q23 .3 tumor suppressor 
in cells of said sample. 

82. The method of claim 81, wherein the cancer is distinguished as metastatic and non- 
20 metastatic. 

83. The method of claim 81, wherein the determining comprises assaying for a TS10q23.3 
nucleic acid or polypeptide in said sample. 

25 84. A method of screening a candidate substance for anti-tumor activity comprising the steps 
of: 



(i) providing a cell lacking functional TS10q23.3 polypeptide; 

(ii) contacting said cell with said candidate substance; and 

(iii) determining the effect of said candidate substance on said cell. 



85. The method of claim 84, wherein said cell is a tumor cell. 
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86. The method of claim 85, wherein said tumor cell has a mutation in the coding region of 
TS10q23.3. 

87. The method of claim 86, wherein said mutation is a deletion mutant, an insertion mutant, 
5 a frameshift mutant, a nonsense mutant, a missense mutant or splice mutant. 

88. The method of claim 84, wherein said determining comprises comparing one or more 
characteristics of the cell in the presence of said candidate substance with characteristics of a cell 
in the absence of said candidate substance. 

10 

89. The method of claim 88, wherein said characteristic is selected from the group consisting 
of TS10q23.3 expression, phosphatase activity, proliferation, metastasis, contact inhibition, soft 
agar growth, cell cycle regulation, tumor formation, tumor progression and tissue invasion. 

15 90. The method of claim 84, wherein said candidate substance is a chemotherapeutic or 
radiotherapeutic agent. 

91. The method of claim 84, wherein said candidate substance is selected from a small 
molecule library. 

20 

92. The method of claim 84, wherein said cell is contacted in vitro. 



93. The method of claim 84, wherein said cell in contacted in vivo. 

25 94. A method of screening a candidate substance for anti-kinase activity comprising the steps 
of: 



(i) providing a having TS10q23.3 polypeptide comprising at least one tyrosine 
kinase site; 

30 (ii) contacting said cell with said candidate substance; and 

(iii) determining the effect of said candidate substance on the phosphorylation of said 
site. 
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95. The method of claim 94, wherein said determining comprises comparing one or more 
characteristics of the cell in the presence of said candidate substance with characteristics of a cell 
in the absence of said candidate substance. 

96. The method of claim 95, wherein said characteristic is selected from the group consisting 
of phosphorylation status of TS10q23.3, TS10q23.3 expression, phosphatase activity, 
proliferation, metastasis, contact inhibition, soft agar growth, cell cycle regulation, tumor 
formation, tumor progression and tissue invasion. 

97. The method of claim 94, wherein said candidate substance is chemotherapeutic or 
radiotherapeutic agent. 

^_ 

98. The method of claim 94, wherein said candidate substance is selected from a small 
15 molecule library. 

99. The method of claim 94, wherein said cell is contacted in vitro 



10 



20 



100. The method of claim 94, wherein said cell in contacted in vivo. 

101. The nucleic acid of claim 22, wherein the mutant is a splice mutant. 

102. The nucleic acid of claim 101, wherein said splice mutant has a mutation in exon C. 

25 103. The nucleic acid of claim 102 wherein said splice mutation results in a change from G to 
T at position + 1 at the splice junction of exon C. 

104. The nucleic acid of claim 101, wherein said splice mutant has a mutation in exon H. 

30 105. The nucleic acid of claim 104 wherein said splice mutation results in a change from G to 
T at position + 1 at the splice junction of exon H. 

106. The nucleic acid of claim 22, wherein the mutant is an insertion mutant. 
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107. The nucleic acid of claim 106, wherein said insertion mutant is in exon G. 

108. The nucleic acid of claim 107 wherein said insertion mutation is a 2 base pair addition of 
5 TT at position 98 of exon G. 

109. The nucleic acid of claim 22, wherein the mutant is a frameshift mutant. 

110. The nucleic acid of claim 109, wherein said frameshift mutation is in exon G. 

10 

111. The nucleic acid of claim 22, wherein the mutant is a missense mutant. 

112. The nucleic acid of claim 111, wherein said missense mutation is in exon B. 

15 113. The nucleic acid of claim 112, wherein said mutation results in a change from T to G at 
position 46 of exon B. 

114. The nucleic acid of claim 113, wherein said mutation results in a change from LEU to 
ARG in the tumor suppressor. 

20 

115. The nucleic acid of claim 112, wherein said mutation results in a change from G to A at 
position 28 of exon B. 

116. The nucleic acid of claim 115, wherein said mutation results in a change from GLY to 
25 GLU in the tumor suppressor. 

118. The nucleic acid of claim 22, wherein said mutant is a nonsense mutant. 

119. The nucleic acid of claim 118, wherein said nonsense mutation is in exon H. 



30 



120. The nucleic acid of claim 118, wherein said mutation results in a change from C to T at 
position 202 of exon H. 
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121 . The nucleic acid of claim 22, wherein said mutation is in a glioma cell. 

122. The nucleic acid of claim 22, wherein said mutation is in a prostate cell. 

123. The nucleic acid of claim 22, wherein said mutation is in a breast cell. 

124. The nucleic acid of 122, wherein said mutation is a deletion in exon A. 

125. The nucleic acid of 124, wherein said mutation is a deletion of AA at positions 16 and 17 
of exon A. 

126. The nucleic acid of 122, wherein said mutation is a change from C to T at position 53 in 
exonB. 



WO 98/33907 



PCT/US98/00353 



1/27 



Localization of Candidate Tumor Suppressor 
Loci on Chromosome 1 0 



Sonodaet aL. 199 6 

Murakami et al. ( 199 6 
Sanchez et aL, 199 5 




Steck et al. t 199 5 
Prostate 

Melanoma 
Saya et at. 

Endometral carcinoma 
Albosoa et al. f 199 6 

Rashed et al. f 199 5 

Saya et al 



Common deletions in 
Prostate Carcinoma 
Gray et al., 199 5 — 
Trybus et al., 199 6 
Ittmann 1996 ^ 



O10S190 
D10S216 
O10S 169 



D10S206 
/ D10S219 
D10S532 

D10S541 
D10S215 

D10S185 
D10S198 



\ 



D10S192 



\ 



3 CM 

4 cM 

3 cM 

7 cM 

5 cM 

2 cM 
LG-11 breakpoint 



FIG. 1 



BNSDOCID: <WO 9833907 A 1 I > 



WO 98/33907 PCT/US98/00353 

2/27 



D10S249 

D10S223 

D10S191 

D10S189 

D10S196 

O10S210 

D10S197 

D10S532 

D10S551 

D10S573 

D10S1687 

D10S579 

D10S215 

AFM086 

D10S541 

AFM280 

W1027S 

D10S564 

D10S536 

D10S583 

D10S185 

D10S200 

D10S091 

D10S571 

D10S110 

D10S574 

D10S577 

D10S198 

O10S603 

D10S192 

D10S467 

D10S540 

D10S530 

D10S521 

D10S597 

D10S562 

D10S221 

D10S209 

D10S587 

D10S216 

D10S169 

D10S590 

O10S212 





o o o op o o o o o o 

OOOOOOOOOOO' 
OOOOOOOOOOO' 
OOOOOOOOOOO' 
OOOOOOOOOOO' 
OOOOOOOOOOOi 
OOOOOOOOOOO 
OOOOOOOOOOO 

o o • op o o o o o o 

o o o op o o o o o o < 

OOOOOOOOOOO 

o o o op o o o oo o 

OOOOOOOOOOO" 
OOOOOOOOOOO' 
OOOOOOOOOOOi 
OOOOOOOOOOO' 
OOOOOOOOOOO' 
OOOOOOOOOOO' 
OOOOOOOOOOO' 
OOOOOOOOOOO' 
OOOOOOOOOOO' 
OOOOOOOOOOO 
o o o op O O O O O O ' 
o o o op O O O O O O ' 
OOOOOOOOOOO' 
o o o op O O O O O O ' 
OOOOOOOOOOO' 
OOOOOOOOOOO' 
OOOOOOOOOOO' 
OOOOOOOOOOO' 
OOOOOOOOOOO' 
OOOOOOOOOOO' 
OOOOOOOOOOO' 
OOOOOOOOOOO' 
OOOOOOOOOOO' 
OOOOOOOOOOO' 
o o o op O O O O O O ' 
OOOOlOOOOOOO' 



a u a 

"*i 0> CO _» U\ 

-* <•> a » - 




" ^ "i g f 9 9 

to «t o 



OOOOOOOOOO 
OOOOOOOOOO 
OOOOOOOOOO 
OOOOOOOOOO 
OOOOOOOOOO 
OOOOOOOOOO 
OOOOOOOOOO 
OOOOOOOOOO 

ooooooo««o 

OOOOOOOOOO 
OOOOOOOOOO 
OOOOOOOOOO 
OOOOOOOOOO 
OOOOOOOOOO 
OOOOOOOOOO 

OOOOOOOOOO 
OOOOOOOOOO 
OOOOOOOOOO 
OOOOOOOOOO 
OOOOOOOOOO 
OOOOOOOOOO 
OOOOOOOOOO 
OOOOOOOOOO 
OOOOOOOOOO 
OOOOOOOOOO 
OOOOOOOOOO 
OOOOOOOOOO 
OOOOOOOOOO 
OOOOOOOOOO 
OOOOOOOOOO 
OOOOOOOOOO 
OOOOOOOOOO 
OOOOOOOOOO 
OOOOOOOOOO 
OOOOOOOOOO 
OOOOOOOOOO 
OOOOOOOOOO 
OOOOOOOOOO 

> c > r c 



lOp 



10q23.3 
-q24 



o 

ro 



O Homozygous loss 
o Allele present 



CO 

3E 

> 
-i 

33 

o 



13 

o 



T3 

r- 



c 

CO 
CO 



2 S 



10q25 



FIG. 2 



WO 98/33907 



PCT/US98/00353 



3/27 



N10.7 Subclones 



D10S249 

D10S223 

D10S89 
D10S196 

D10S210 
D10S195 

D10S219 

D10S215 ^ 
AFM086 [ 
D10S536 | 
D10S563 

D10S192 

D10S205 

D10S534 
D1.0S530 
D10S566 

D10S108 
D10S521 
D10S597 
D10S562 
D10S554 

D10S187 

D10S1158 
D10S610 

D10S209 
D10S587 

D10S216 

D10S169 
D10S590 

D10S212 
O10S555 
D10S1134 




< o 

in in 
o o 



i 



1 Op 

£> 

Q. 



in 
5* 

3 
CO 



H 

V) 



10q23.3 



0< 



HZ 



m 



m : 



"Fully" suppressed hybrids 



"Partially" 
Suppressed 



Hybrids 



FIG. 3 



BNSDOCID: <WO 9833907 A 1 I > 



WO 98/33907 



PCT/US98/00353 



4/27 



Tumors 

2 3 



D10S249 

D10S201 

D10SS32 

D10S551 

D10S579 

D10S21S 

AFM086 

D10S541 

AFM280 

D10S564 

D10S583 

O10S185 

D10S571 

D10S574 

D10S198 

D10S192 

O10S540 

D10S521 

D10S597 

D10S187 

D10S587 

D10S216 

D10S575 

D10S169 



o< 

<2M 



4 5 9 10 11 12 13 14 15 16 17 19 20 21 22 23 24 25 26 27 28 



o 
o 



o 



oooo 
m 



o 



o 
m 



oo 

©© 

o© 



o 
o 



©oo 



o 
o 



ooo#oo 
olo##© 
o©## 



©o 



o 



o 
o 
o 
o 



»ooo< 
>#@©i 



><D> 
KD 



• titlrtoitootoioit© 

•©ooo«©g§oog©e©o©S 



o 



• 



OO 

©<§> 
o© 



oe 
©© 



i0§90« 

A A I I I I A'A I A'A 



O©© 

o©o 

OOO 
OS© 

o©o 



»o 



©©•© OO(0) 

• eoo o©@ 

§OOOG ©<© 

• §0OO«(D 

o©#© 



c 
o 

'5> 
tr 



(0 

o 



J 



Region NOT 
Involved 



O 

Heterozgous 
maintained 



LOH 



Non- 
Inf ormat ive 



GBMs 

Instability No Data 



FIG. 4 



BNSDOCID: <WO 9833907A1 I > 



WO 98/33907 



PCT/US98/00353 



5/27 



Mr (kb) 



12 
1 1 - 

6.8 - 
6.7 " 

5.f- 6: 
5.2 - 
4.1 " 

3.2 • 

3.0 " 

2.8 

2.7 : 

2.2 ' 



0.8 



Eco R1 Bands 

15 
14 

\l (V) 

11 

10 

9 
8 

7 

6 

4 5 
23 
1 

1a (V) 



106d16 




EFC-2 LG-11 A172 D54 




Eag! 

BssHII 

not 
sacll 



Bam Ht 



(V) -vector 

<V) xhol not 
13 I 1 1 — 



14 



1, 4, 5 [7 2 3 

t — r 



46b12 



Bam HI 



10 1 



Bam HI 



1 96 



15 



Not /Eco Band (#14) 



xhol 



Bam HI 



121 



not 



106d16 



Critical Region 



LG-11 

D54 

A172 

Normal 

lymphocytes ' ■ — — — — — — — — — 

present Y////A Homozygous delete 



FIG. 5 



WO 98/33907 



PCT7US98/00353 



6/27 



CCTCCCCTCGCCCC 
CGGGCAGGCCG 



CCTCCCCTCCCKCGCCGCC^^ ,00 

ea^woGo^wsAooaMac^^^ soo 

™««»«^^ 1200 

™»«*«^^ 1400 

™«««^^ 1600 

*»*™»^^ 1700 

"»««»™»^^ ^ 

^*w«"™^ 2100 

"00 

««»«««™«^^ 2JO0 



cca^Ia^atI^^ 3000 



FIG. 6 



BNSDOCID: <WO 9833907A1J . > 



WO 98/33907 



7/27 



PCT/US98/00353 



1MTAIIKEI V.S RNKRRYQEDGFDLDLTYIyp 
31 N I IA MCPPAERLECV yRNNIDD VVRPLDSK 
61HKNHYKIYNLCAERHYDTAKFNCRVAQYPF 
91 EDHNPPQLELIKPFCEDLDQWLSEDDNHVA 
121AIHCKAGKGRTGVM1CAYLLHRGKFLKAQE 
151 ALDFYG.EVRTRDKKGVTIPSQRRYVYYY^Y 
1B1-LLKNHLDYRPVALLFHKMMFETIPMFSGGT 
211 CNPQFVVCQLKVKIYS5NSGPT R R X X> X F M Y 
241 FEF PQPLFVCGDt'XKVEFFHKQN K M L K K D K h ' 
271 FKFWVNTFFIFGPEETSEKVENGSLCDQE1 
201 D S I C S I E RADNDKEY LVLTLTKNDLDKANX 
231 DKANRY FSPNFKVKLYFTKTVEEPSNPEAS 
361 SSTSVTPDVSDNEPDKYRY-5DTTDSDFENE 
351 PFDEDOHTQITKV 



FIG. 7 



WO 98/33907 



8/27 



PCT/US98/00353 



46b12«Sp6 



AFM 

loscne oee ^oe 2F20 

^ p- j . 106el16.S6 ©Sp6 

B A 



D10S541 




f «glon 
Homozygouo cielotlo 



0«l«tlon Boundaries 



FIG. 8 



WO 98/33907 PCT/US98/00353 



O A t- 




(J (1 ti 




ii r> 




t? 


i o 


(J 








(_i 


i a: 






o 






i IX 






u 






• Vi 






o 




o 


• c 






(J 




(J 




{J 








o 


t a. 












i IX 


o 




o 




u 


1 t- 


E-* 








u 




1- 




u 




e» 


t 










-> 




u 




(J 




u 


i D. 


(- 




v-> 




o 


t £- 


o 










t a. 


u 




«-■ 




u 




u 




o 




u 


i a: 


t- 




o 






! < 


u 




f- 




u 




t- 
















u 





o 

u 

o 

o 
e> 
o 
u 
u 
o 

o 
< 
u 
o 

u 
o 
a 
o 

o 
o 
< 
o 
o 
u 



U A 

e> ii 
ti 

o 
< 
o 
u 
o 



C3 

< 

o 
o 
o 

CD 



o 
o 

L) 
E- 
U 

U 
V 

o 

H 

u 

< 
o 

u 
o 
u 
o 



ix a: 



fc- c 




J o 


A 1 


o e> 


II * 




11 


e> c? 




a o 




o o 




u o 




o o 




o o 




o o 




u u 




o u 




< < 




o o 




o o 




o o 








3 8 












u o 




u u 




o o 




CJ (J 








< < 




u u 




C3 O 








u e> 




o o 




u o 




u cj 




o o 




o u 




u u 




o o 




< < 




O V 








< < 













< 



o o 

I- t— 

u u 

u u 

o o 

o u 

O C3 

< < 

E- E- 

O tJ 
E~ 



O O 
U U 

u o 
u o 

<: < 

< < 

< < 



W CO 

to to 

a. q. 

< < 

< < 



a: en 
o o 

co to 
a. 

w ti 

cc O 

< < 

< < 

< < 

< < 

U3 U 



a 
a 

m to a cx m 
e n oi cu qj a • 
3 o o aaaaen 

• • • o c a> a u o' 

H H H O ID (/) • • O 

■ • • • £ D D) O.H 
vov£>vD*JD O OQJtr. 
UOU ^ ^ ETJ at- 



C <1J 

(0 0) 

E D 

d o 



a 

(D Q) a. • 
cx a a a e n 



o c tt) 

O to in 

• E 3 

■ ■ 3 O 



<D O <N 

a u cr 
• • o 

Cn CX^-n 
o a) in 



r- co vd cr^ oo. m 



a 



a ■ 

E m 
o fs 
u cr 
■ o 

Q> CO 

a t- 



VDm( — U) U) Ox CO i/i 



o o 

E 'O 



vo vo vo 
UUtJ 



a a 
a a a a 

S • • Q) 

o c o) a, 

VJ ro to * 

• e 3 at 

^■300 
- ~ E T5 



(6 



C Q) 

TO 10 

E D 

3 O 

-C E 



aaaaEn 



vo vo 
CJ O 



E 

O C 0) 

L) 03 CO 

• E 3 

3 O 



0)0 04 

a, u a 



D> CLrH 

O Q) CO 



a ex: e 'o ah 

vomr-^o^rvocrvoo ml 



Rw<;nnr.in 5 yyo QRrwoo7Ai i > 



WO 98/33907 



PCT/US98/00353 



10/27 









VT 


r* H a 


t 






U O ii 


i 






ti 








cd cd 


i 


CD 


o 










< *t 








cd cd 


! 


CO 


o 


L9 *H 








CD CD 








<L <. 




CC 


PC 


*t CD 


* 






< < 








CD CD 


i 


It] 


u 


H O 


* 






CJ CJ 








CJ t 




CU 




E- i 


* 






CJ i 








in £ 




CO 




H H 








CJ U 








CD < 






f** 


CD O 


■ 














f- cj 


♦ 


CO 


o 


CJ cd 


* 
















* 


to 


w 










cj o 








B H 


* 


CO 


CO 


CJ CJ 










• 


< 


< 


< < 








CJ o 


• 










< 


< 


H CD 


* 






CD CD 


■ 






CJ cj 


• 


cc 


cc 


CJ CJ 


j 






< < 

a a 




2: 












CJ CJ 


• 


CO 


CO 


H CD 
H E- 


* 


Uj 


(J 


CJ CJ 








H f- 








CD CD 




> 


> 


CJ CJ 
















H CD 




u. 


> 


















U CJ 




►J 


-J 






















*J 












cd cd 








cj o 




a: 


cc 










Bb 








E-* CJ 




to 


cu 


|- CJ 


* 






H CD 


* 






cd cd 




> 


CD 


u u 








cd o 








CD CD 




CD 


CD 


CJ CJ 








o cd 








CD CD 




CD 


CD 


CJ CJ 






o cd 








< < 




CO 


CO 


CJ CJ 








cd cd 








cd cd 




CD 


CD 


o o 








o cd 








CD CD 




CD 


CD 


U CJ 








8 8 












CD 


CD 


CJ u 








U CD 









CD CD 


A 


1 






u o 












[! 








CD CD 




( 


^ 




a a 










CD CD 






U3 


U 


«C < 










CD CD 










CD CD 






CD 








■ 


CD CD 




J 






CD CD 






CD 


CD 


CD CD 










CD CD 










CJ CJ 






CC 


oc 


CD CD 










< < 










CD CD 






Li] 


w 


CD CD 










h h 




i 






CJ u 






.J 




CJ CJ 




j 






E-» H 




, 






o u 




i 


J 


J 


CD CD 










CJ o 




, 






CJ o 
CJ CJ 




i 


CU 


CU 


H £-> 
CJ U 




i 
i 


»J 




CJ u 










< < 




J 






CJ CJ 






31 


3Z 


CD CD 




, 






CD E-« 










U U 




i 


Q5 


»J 


CJ CD 










CD CD 




i 






CJ o 








ck 


CD CD 




i 






CD CD 










CJ CJ 




J 


cc 


cc 


CD » 










CD i 










U i 




* 


PC 


o- 


CD i 










CD i 




* 






O . 




* 


a: 


o- 


CD i 










O i 




* 






CJ i 




* 


d: 




CD fr* 




* 




CJ CJ 










CD O 






< 


< 


CD CD 










< < 










CD U 










E- CD 










O CJ 










CD CD 






< 


< 


CD CD 










O CJ 










CJ CJ 






a. 


CL 












S 3 










88 






u 


W 






















8 8 






K 


cr 


8 S 




















CD CD 






CD 


CO 






















CJ CJ 








«-3 


CD CD 










CD CD 
















CD 


CD 










88 
















tx 


CC 


CJ CJ 










CD CD 










< < 






CO 


CO 


< < 








CD CD 










: s 




* 


o. 


cc 






* 

















O CJ 

CD CD 

£-> e- 

CD CD 

< < 
CD CD 

< < 
CD CD 
CD CD 
CJ CJ 
CJ CJ 
CD CD 

CJ CJ 

3 S 

CD CD 

CD CD 

CD CD 

H *-« 

CJ CJ 

CD CD 

CD CD 

CD CD 

< < 



< < 
CJ CJ 

CD CD 

H 6^ 

CJ CJ 

CJ CJ 

CD CD 

CJ CJ 
E- 

CD CD 

< < 
CD CD 
H CD 
CJ CJ 

O CD 

CD CD 

CD CD 

CD CD 

CD CD 

< < 
CD CD 
CD CD 
CD CD 

< CD 
CJ CJ 
CJ CJ 

U t 

CD i 

< i 
CJ i 

P 1 
fr> i 
O t 
CD r 
CD i 

O CJ 

CD CD 

CD CD 

CJ CJ 

CD CD 

CJ CJ 

CJ CJ 
CD i 
CD i 
CJ i 

CD CD 

CD CD 

CJ CJ 

CD CD 

CD CD 

CJ CJ 

CD CD 

CD CD 

CJ CJ 



CO Ul 
CD CD 

< < 

CD CD 

< < 

DC DC 
CO < 
M M 

> > 
a. cu 

CO CO 

u u 

CO CO 

CD o 

tr cc ■ 
CD CD 

CU Cu 

< o. 

< 

< o- 

< o. 

< < 

< < 

< < 



CD CD 

CJ CJ 

O CJ 

CD CD 

CJ CJ 

CJ U 

O CD 

CJ CJ 

CJ CJ 

O CD 

CJ CJ 

CJ u 



t- CD 

t- t- 

O CD 

CD CD 

CJ CJ 

CJ CJ 

CD CD 

CD CD 

CD CD 

O CJ 

O CJ 

CJ CJ 

< < 
CD CD 
CD CD 
CD CD 

< < 
CJ CJ 
CJ CJ 
E- fc- 

< CD 

CJ CJ 

< < 

CJ CJ 

CD CD 

CD CD 

CJ CJ 

CD CD 

CD CD 

CJ c; 

CD CD 

CD CD 

CJ CJ 

CD CD 

CD CD 

CJ CJ 

cd c: 

CD CD 

CJ c> 
e> o 

CD CD 



CD CD 
E~* E— 



CJ CJ 

CJ CJ 

H i- 

U CJ 

€-* E- 

O CJ 

H E- 



•a. < " 

< < 

CO CO 

a, cu 

21 CO 
> > 

a. a. 

CD CD 

CU CU 

CD CD 
O p 

»-H > 

a: x 

cc c; 
cc cc 

cc cc 

CC Cl 
CC CC 

cc cc 
t- E- 

cl a 

CU CU 

cu a 

CO CO 



c 
o 
o 

cn 
CD 

LL. 



3 o 



a 

a a m 
cu cu a • 
a a a a e r-i 



O C 0) 

U ffl to 

• B 3 

±J 3 O 

C £ E 



0) O rvj 

a u cr 

. . o 

O CU CO 

-o an 



a 

CU 

a 

CU QJ Di • 

& cxaa En 

* 0) O CM 

<d au tr 

CO * * o 

o o cu co 
E TD ah 

vd n i — °o^vx>a>GOun 



c cu 

CO CO 

3 o o _ 

• • • O C 

f-H i—l «— 4 U (D 

• * • * E 

VD VO U) U ^ 

UUU C£ 



C Q> 

(0 CO 

3 O 

-C E 



a 

0) 

a 

cu a> a. * 
Q. a a a E n 



O CJ 



O C QJ 

O aj cn 

• E 3 

3 O 



CU O rsi 
a U CT 
* • o 

CO Oi«-i 
O CU CO 



vomr-tn^rvDcrvoou^ 



C CU 

m co Q> Cu 

E 3 CO 0) 0) 

3 o- o aaaa 

x: E T) E • • QJ 

. . • O C 0) Q. 

HHH U Ifl K • 

• • • • E 3 D> 

\o \x> M> aj 3 O O 

CJ O O C X E 



Cu 

O rvj 

U " 

d 
an 



rv> ^ 



WO 98/33907 



PCT/US98/00353 



11/27 



u 

id cd 
■ o 
u 
6- 

cd cd 

cd cd 

<c < 

cd o 

cd u 

u cd 

lr- U 

cd cd 
u u 

u u 

cd cd 

< < 

u u 

cd cd 

cd o 

<c «£ 

cd cd 

cd cd 

u < 

cd cd 
cd cd 
u u 
(J 
cd 
u 
o 



cd u 
cd cd 
u u 



cd o 
o u 

o o 

cj u 

cd cd 

o o 

o u 

u u 

u u 

(J u 

u u 
o u 

u u 
cd cd 

(J u 
u cj 



— 

A 

II 

II 

< CO 

< < 

< < 

CO 00 
X X 

& o* 

CO CO 
CO CO 

CO a* 

p" >■ 
o o 

to (J 
<x a 

CO CO 

cd cd 
o cd 

cd cd 

< < 

< < 

us to 

a. ol 

< < 
pc oc 

< < 
> J 
a. a. 

a. CL 

< < 



u u 
u u / 

I 

cd o 

(J cd 

cj u 

cd cd 

e h 

u o 

cd cd 

cd cd 

u u 

CJ u 

u u 

< < 

< < 
u u 
u u 
o o 

< < 

U U 

(5 cd 

< < 
o o 

CD CD 

u u 

u o 

cd cd 

u u 

u u 

cd cd 

< < 

(J (J 

cd cd 

< < 
o u 
u u 

< < 
u o 
o u 



5 S 

u u 
cd cd 

I- H 

u u 

8 t; 

CD O 
< 

o u 

u u 

CJ o 

cd cd 

15 CD 

< < 
u o 
o u 

< *t 

< < 

< < 
o o 
o o 

< < 
o o 

u u 
o o 

U (J 

o o 

CD O 

o u 

t- E~« 

o u 
o u 

< < 
o u 
u o 
o o 

U CJ 



ttc ; — 

A I A 

II I II 
II II 

> > ' 

I 
I 

< < > 

I 

I 

< < I 

I 
I 

a* a. i 

I 

>■ >• I 

I 
I 

CO V) 1 

I 
I 

CO CO i 

I 
I 

CC PC I 
I 

a: a: i 

1 

to, CO i 

I 

CO CO i 

t 

I 

CO Vj i 

a< a. i 

a a 

I 

w3 i 

to CO i 
I 

o o • 

«C < i 

O O ' 
I 

^ i 

» 
I 

I 
I 

< < t 

I 
I 

< < I 

I 

—3 _3 i 
I 
I 

cl a * 
I 

I 

a: on i 

I 

< < t 



uuu 
o o o 
< < u 



u 
o 
u 
u 

o 
u 
u 



u o 

< < 
o o 

< < 

CJ o 

u a 

O O CD 
CJ CJ CJ 

CJ C- (J 

CJ u u 

U U CJ 
O 13 O 

5 5 S 

o o e> 

3 S < 



< < o 
CJ u u 
u o o 

uuu 

h h 15 

< < < 

U U ID 



O O Cu * 



f- 1 H 

H U 

H U 

< O 
U U 
O CJ 

o o 

< o 
u o 

< u> 

U CJ 

u u 
t- u 
o o 

5 § 

o o 

O CD 
U U 



CD CD 

CD CD 

CD CD 

CD CD 

< < 
CD CD 
CJ U 
CD CD 

< < 
CD CD 

< < 

u u 

CD CD 

CD < 

CJ CJ 

CD CD 

a cd 
u u 

CD CD 

< < 

u u 
u u 

CD CD 

< CD 
CD CD 

< < 



I x. 06 * 
I I 
I I 

i se: oc * 
i i 

» i 

I 0» O £E * 
i i 

l J J J.i 

* i 

I H H (/) ♦ 

I I 

♦ I 

i to to u * 
« I 

♦ H H < « 

I I 

I < < < t 

* t 
l I 

• dc a: CD * 

♦ to to a. * 

■ i 

\ zc a: * 
i i 
t t 
i < < < i 
i 

*- e- < • 

< < CL . 

to CO ft* * 

< < 

CD CD 

ta u 

< < 

< < 
a a 



CD CD 

< 
< 
< 
< 

< < 

CD CD 

< < 

o u 

CD CD 

< < 

t- f- 

O CD 
O U 

< < 

CD CD 

< •(£ 
CD CD 

< CD 



<- < < 

- - CJ u 
_ CJ u 
CD CD CD 

- < < 

u u 

<. < 

CD CD CD 

H (- 6- 

< < < 



CD CD 
O 

u 

U CJ 
O CD 



CD CD CD 

' < o 

CJ u 

< < 

o u 

u o 

CD CD 



u. a it 

OC DC 

ae: ic; 
2 2 2: 
a a k 

CO CO to 

> > > 

H 1-4 H 

U to W 



U u 
U U 

u u 



« EC CC CC 



CO to to 

Ik Ik (/) 
u. u. o 

II b. b. 
CO to CO 

co co in 
j J j 
to to to 



(L (k & 



o 



C Q> 
(0 CO 

o o 



vo vo 
UUU 



a a 

o. a ex cx 

E • • G) 

o c a; a 

U (0 to • 

• E 3 Di 

w D O O 



a 
a ■ 

B cn 
O 

u cr 
• o 

Qi t-H 

a> co 
an 



vo n co^vocncOLn 



C 0) 

to cn 

§3 Cn 
o o a 

JC e T> E 

. . . o 

HHH O 
VOUJVD U 



c a> a 

«J to * 

3 o o 

uuu c: x: e t> 



a 
a 

m 

a • 

E m 

O CM 

u cr 
• o 

QJ CO 



a a 

0) Q) 



c cv 

<o CO 
a o o 

J= E TD _ 

~ C Q) 

(0 in 

E o 

I VO VD VO D O 



a 
a 



o, a a a e n 

(DON 



o 



a o cr 

• • o 

CTl O. t— l 

o <u to 



^ ^T* tt <y ^ 



UUU £ -C E XJ ttt- 

fO | — CO \D <T\ CD \J~\ 



m co a a 

EDO) 0) 0) 

Z3 O O Q. O* CU Oj 

£ e n h * • a> 

• • • o c id a 

HHH U (D 01 * 

• • • • e 3 ai 

VC <X) VO u D O O 

uuu c x: ETJ 



m 

a • 

E cn 
O cm 

o a 
• o 
a 
<u co 



vomr-co^crvotTxcom 

*~ T~ ~? *T 



WO 98/33907 



PCT/US98/00353 



12/27 



o o 




> 










LJ • 










o o 


O 










o u 




• o 


o 


o » 




< < 


< 








< < 


< 














i LJ 


u 


UJ i 


LO 




H 
f- 












U 


t _J 


•J 


m3 i 




< < 


< 










o o 












<. < 


< 


i o: 


cu 


IX | 


CL 


< < 












< < 


< 














' Ul 


U) 


LJ i 


(ij 


< < 


< 










u u 


u 






1 




o o 


o 


i < 






< 


H E- 


H 










CJ u 


U 










O O 


O 


i Cu 


CL 


CL i 






t- 
























e- 


. u. 




U. i 




< < 


o 










o o 


o 










c? o 


o 


1 u 


o 


O i 


e 




o 












e- 










< < 


-c 






E < 






E— 










O O 


U 












O 


1 < 


< 


< • 


< 


£ 8 


t- 










E- 










< < 


< 


1 l-H 










E- 






















< < 


< 










O E- 










2 5 5 










< < 


u 


* 










o 










u o 


o 


i a. 






CL 














< < 


< 












i— 


i >* 




>• 






e- 






















< < 


< 


i i 










E- 










< < 


<. 












t— 


i >* 


>- 


>* 


>* 


U U 












U U 












< < 


< 


i E- 


(- 


E— 


E- 




























E— 


! j 










U 
























O 


i o 


a 


a 


i ^ 


< < 


o 










E- H 


E- 










6- H 




• j 








U U 












< «C 


*c 










e> o 


o 


1 a 


Q 


Q 


a 


u u 


u 






















£- €- 




1 u. 


U. 


U. 


1 u. 


< < 


O 


* 










O 












O 


i (j 


O 


o 


i u 


H E- 


E- 






< < 


< 










o o 


U 


! a 


O 


o 


i a 


o o 


O 










< <t 


< 










o o 




1 LO 


U 


LO 


I LJ 


< < 




• 










< 










o o 


o 


J o 


o 


o 


! o 














< < 


< 
















>■ 


> 




< < 


U 


* 





















<-< < < < 



EE 



f:f:t: 



U U U 
U. U. tu 

a. cl cu 

i£ i£ M 
•—i i •— « 

J J J 

LO LU CU 
J J J 

a a a 

CL CL CL 

cl cl a. 

2 S 2 
XXX 
Q Q Q 
W W W 

Ll Ll Lu 
Cu CL CL 
5- 

o o o 

< < < 
> > > 

a: cc a; 

(J O U 

2 2 2 

Lu Lu U. 

x ^ 

< < < 

E- Er» H 

Q Q O 
>->«>* 



< 

o 



u u o 

< < < 

u o u 
o o o 

H E-« f- 

E- E- 1 E-« 

O 15 U 

E- fr> H 

< < < 

u u o 
h h e- 

< < 

< < < 
o o o 
<; < < 
o o o 

f- E- f- 

< < < 

u o o 

< < < 

< < < 

e> o o 
fi 

o o o 

< *c < 

< < < 

E— E— E- 

O U U 

o e> o 
o o o 

t- H 

< < < 

< <: < 
o u o 
a u o 

< < < 
o o o 

E- E- H 

f- f- 

u a u 
e- h e- 

< < < 
o o o 

< < < 

< < < 
o u o 

H H 

u u u 



£ S E 

> > > 

o o o 

K DC OS 

O U CJ 

« ^: ^ 

O O o 

< *c < 

^ ^ 

o o o 



< <: < 
> > > 



a o q 

CCD 
LJ Ll LJ 

vi c/; cn 
*J J J 
3 3 3 

o o o 
a o a 
j ^ j 

a o a 

LJ LO LO 



C 

o 
a 



f0 CO 

3 o 



a 
a 

tu o> a • 

E • * Q) o cm 
O C OJ CL U D' 
O (0 « • • o 

•E3 & a*-i 

*J 3 O O <U Ul 



M> — CO t)t VO CO IT. 



C GJ 

<0 to o* Oj 

£ P Dl 0)0) 

3 o o cx CL Q* a 

& • -0) 

• • O C 0) 
<-h t-i O nj o) • 

• * • B 3 D) 

VD VO U> -LJ o o O 
UUU C JC E TJ 



a 
a 



Qu • 
B r-t 
O r>a 
u cr 
• o 

an 



^ ^ ^ rn — j 



<G CO 

E 3 

3 O 

JC E 



a 

QJ 

a 

CL CL r~i 
Cn 0J Qj CL ■ 
O Qu CL CL CL E 
T3 E ■ • A) 0(\ 

• (J C 0) CL U C 
h O nj io . o 

* • E 3 Di CLt^ 
lj 3 O O QJ CO 

U C£ af- 



ro cn CL CL 

E 3 D» <D Q> 

3 O O CL CL CL CL 

J= ET3 E • • QJ 

* • • O C QJ a 

H HH O (D (fi 

• • ■ E 3 Cn 

VOVOVO U 3 O O 

U U U C £ £13 



vDnr-ooTTvocncoiri 

r»> — « ■■-■» tth , — l 



a 
or 

QJ 



a 

E m 
o 

u cr 
• o 
a 

0) CO 

at-* 



WO 98/33907 



PCT/US98/00353 



13/27 



c 
c 
* 

o 

o*— * 

■ 

O 

co — 

O 

r-— 

O 

vO— 

O 

in— 
o 

*T 

o 
«y 

o 

r*— 

< 

2° 

r- 
cr. 

<N 


J 0 'J 1 0 0 cj • 0 

t < < 1 

a; < 1 x 1 co — 

5 5 5: : 

a: «t < ' s«s ic 1 1^ 

J U 0 i • ; 

j O O 1 O Q O 1 C ( 

< < " ' < 
*> O O 1 1 

< < < l OS OS OS t OS 

O 

L) U U ' 1 »™ 
L) U U 1 ' ^ 

< < < t .H H H < H 
13 O O ' ' 

O O O ' ' 

< < < 1 OS OS DC 1 CC 
2 < < ' 1 

^ E- H ' ' 
OOO ' > > > 1 > 

< < < . < o 

< < < 1 1 

(J O U 1 UJ L0 U3 1 UJ m 

0 0 0 1 1 

cn O O 1 1 

000 1 to 0 O t c 

H H H 1 1 

p p p , > >. X 1 >- 

OHO* ' 

*- ' ' 0 

t- f_ f- 1 U* b~ U. 1 lu U"t— 

< < S 1 ! ^ 
OOO 1 O Q Q ' Cj 

< < < 1 » 

h h h » , , ■ . 

0 0 U 1 ~ -> -J ' — 

0001 • 

0 0 U ' ' 
OOO 1 < < < ' < 

0 

0 0 0 « 

t in 
000 1 u u u 1 l; 

5 5 5 i : 

O 0 0 1 O O O " 0 

< < < 1 1 
0 U O 1 ' 

0 0 0 » < < < 1 < 

0 0 0 • • 

0 

< < < 1 ^ — 

< < < 1 >r. 1 x m 

< < < * 

h h h • ^ . , 1 . 

E-f-O* -3 -3 » — 
HHH' 1 
H E- E- < 

£-i (— c 1 u. U. It- 1 u. 

< S: < 1 

0 

< < < 1 *S b: > r>»— 

0001 f» 
O O O • 

U U U 1 OOO ' c 

0 0 0 1 ' 
0 u 0 1 1 

u O O 1 OS OS OS l CS 

H H E- ' ' 

rtl *C 1 • 

O U u 1 xxx 1 x ^ 

< O < * ' 
HHH 1 ' 1/1 

e- H 1 J -J J ' -3 

< < < 1 ' 

^ H H J J J 1 J 

E- H E- 1 ' 

< <. < 1 • 

E— €— f-» 1 >■>•>* ' > 

< < < 1 « 

0 

- O O U 1 0- 
0 U U 1 < < < 1 < 10 

e- e- h 1 • 
0 0 0 ' 

8- E- H • O U O 1 O 

< H H * 1 
H f- H ' ' 


(~> f 1 U* U. U. 1 ix, f 

3 O O ■ • f 
-» E- H 1 • f- 

-Of-* JJJ t bJ r-— < 
3 O O 1 t vo t 

- E- F- i 1 * 
J O O 1 J *J -3 > ~~< 

< *C < t 1 
J O O « ' 

j C3 O t < < < 1 < 

J O (3 1 < *. 

H E- • ' * 
^> O O < > > > • > * 
0 

<(_><+ 1 \£>— 
U O O 1 1 

Cj 0 0 1 a. cl a. 1 a. 

< < < 1 • 

C3 O O « ' ' 
<<< 1 OS OS OS 1 OS I 
HUH* 1 < 

< < < 1 t 

H H H 1 ><>->- 1 > 
H H H 1 1 

O 

< < < 1 i in— 

OOO 1 O Q O 1 O VD 
C3 O O t « 
H H H • * 
O U U 1 J J J 1 
HUH* 1 

< < < 1 1 

U U O 1 XXX 1 X 
H H H ■ ' 

< < < 1 1 

O 

< < < 1 2 2 2 1 2 ^r— 
C3 < t3 * » 1 ud 

5 5 S 1 s*; 1 x 

< < < i 1 

H H H 1 • 
HUH* J^J 1 - 
t3 C3 t3 ' ' 
H H H 1 1 
U U U i 1-3 *-3 t-J 1 J 

O 

O O O 1 » '->— 

< < < l l Vf) 

H H H « >«>•*>* 1 >- 
CJ O CJ ' 1 

0 e> 0 1 • 

< < < 1 www 1 cn 
H H H " 1 

< < < 1 1 

H H H ' ><>•>■ 1 >• 
H H H » • 

0 

< < < 1 1 r»j— 
HHH «*>•>•>- ">■ ^> 
H H H < ' 

< < < 1 

H H H 1 1 >• 
O < O * 
H H H ' 

OOO < > > > " > 
H H H ' » 

< < < 1 

O 

H H H • X X > ' >' 

O U CJ • ' VD 

0 0 0 » • 

U U CJ • OS OS OS 1 Cl 

u u U 1 » 

0 0 0 t 1 

< < < I OS PS OS 1 OS 
CD C3 C3 • 1 

*C *< *C 1 1 

U U U 1 000 • 0 

0 

■ H H H ' 1 O— 
CD O CD » ' 

< < < 1 WWW 1 CO 
U O O * 1 

U U O ' « 

000 * a. a, 1 a 

est: : 

<; << <C 1 w »h w 1 
H 5 H * 1 

0 

- U U L> 1 1 o%- 
<<<iHHH'H m 

< O < ♦ ' 
H H H t t 

CD CD CD 1 > > > 1 > 

< < < 1 ( 
CD CD CD ' » 


H J— . ' >^ ^ 1 C. 

C < U * ' < 
-« H H » ' < 
0 

D CD CD • • r* C 
C < < 1 ' E- 
d *5 < 1 i X ^ 1 v. < 
3 CD CD 1 ' E- 
-. E- H 1 " < 
DCDCDi>>>«> C 
D CD CD i 1 fc 

< *t < » • C 

< < < 1 ^S 5tf X 1 ^ C 

O 

< < < » 1 m — E 

- H H ■ 1 r- C 
j 0 O » J J -3 t — I 
J CD O 1 1 t 

< < < • 1 1 

j 0 c; 1 O C* O 1 O < 

J U U ' 1 ( 

CD CD CD 1 1 

- H H ' U O O 1 O 
CJ O O 1 t 

O 

H H H 1 1 "3> — 
CDCDOi>>>i> r- 
CD O CD ' 1 
'r> H H > » 
CD CD O 1 > > > 1 > 
■* H H 1 1 

? H* H '■ b. fau U. i U. 
CD CD CD ' 1 

< < < 1 i 

O 

U U U I OOO ' O rr- 

H H H 1 t r- 
U O O 1 1 
U u 0 t a. a. a. 1 a. 

H H H • 1. 

*c <: < « i 

< < < i2 2 2 1 — 
CJ CJ CJ 1 1 

O CD CD • » 
H H H ' CJ CJ CJ 1 

O 

H H H • • rvi — 
O U U ' t r- 

3 5 5 : H H : E " 

CD CD CD • • 
CDCDCD 1 CD CD CD •! C 
O U O • » 
CD CD CD * • 
CD CD CD « CD CD CD ' C 
H H H » 1 

O 

CD CD CD ' 1 >-»— 

< < < r w W W 1 t r- 
O O U 1 • 

H H H * ' 
H H H • U. U. U. 1 u- 
O CD CD ' ' 
H H H ' t 

< *C -C . 2 2 *Z 1 "Z 

< < < • • 
O O O • 

O 

000 < a. a. 1 0 — 
H H H 1 
H H H • • 

< < < •»—••—•♦—* 1 
H H H 1 < 
O U U • 1 

«C < <. 1 H H H 1 H 

< < < 1 1 

< < < 1 1 

CD CD CD 1 U Ul U 1 til 

O 

■ H H H ' 1 <n— 
H H H • 1 
H H H 1 U. lb Uu 1 U. 
CD CD O t ' 
H H H ' t 

< < < ■ S s s • n 
pop 1 

H H H « ' 

< < < 1 S £ 2 1 S 
CD CD CD ' 1 

O 

- < < < t t CO— 

< < < 1 ^ ic 1 ^ « 
0 u 0 * • 

< < < • • 

0 0 0 ■ X X X l X 
H H H • > 
H H H • " 


i% )k : u ■> i> ; i> 
5 S : ^ * * ! « 

H H r | 

* < < 1 H 1 M 
-•HO* 1 
C < . 

D CD CD 1 Q O Q , 0 
"* H H 1 1 
3 CD CD 1 I 
0 CD CD 1 CD CD O 1 CD 

-HO* , 
J CD CD < I 
^ H H , O O O J O 

- H H ■ 1 

J> CD CD 1 > > > 1 > 

- H H 1 ( 
<J CJ Cj i 1 

J O O » Qi CL. CL 1 Cu 

< CD CD « 1 

^ H H 1 J J J 1 J 
CD < < * 1 
U U U 1 1 
U O O 1 a. a. cl 1 Cl 
CD CD O 1 1 

< < < 1 1 

O O O • O O O 1 O 
H H H 1 1 
U U U 1 t 

U U O 1 cl a. a. 1 cl 
u 0 O j 1 

H H H 1 U. U. U. ! 

CD CD CD 1 1 

*C < < i 1 

O CD O ( to oa Ct3 l U 

H H H 1 1 

H H H • 1 

H H H 1 U. U. U. 1 Iju 

O O O • 1 — 

< < < 1 1 

H H H ■>->•>* 1 *x . 
CD CD CD i 1 *♦ 
H H H > 1 J 

< < < 1 2 E 2 1 E ! 
U O O 1 1 < 
H H H 1 1 ( 

CD CD CD ' • ** 

<. < < t ' c 

0 u O t 1 

< < < 1 1 

CD CD CD 1 Q O O iQ f 

< CD < ♦ t > 

< < < 1 1 - 
OOO • U U U lb] L 
O O O * ' 
OOO 1 ' 

O O O ' cs c; cs 1 os 

< O < • " 
OOO' ' 
OOO 1 OS QS OS I OS 

< 0 < « « 

OOO" 1 

< < < 1 H H H 1 H 
OOO' « 

0 0 0 1 1 

O U U 1 CL CU CL 1 CL 

- < < < • 1 

O CD O ' 1 

0 0 0 « 0 0 0 » 0 

*£ < < 1 1 

0 U O • 1 
HHH ' WWW 1 W 

< < < ■ 1 

< *C < 12 2 2 • 2 
CJ CJ CJ 1 1 

- O U CJ ' ' 

E- H H ' WWW l W 

0001 » 
U O O * • 
HHH • W W W 1 W 
HHH" ' 

< *Z < * ' 


"N. 
f-< 

00 
CN 

tr 

tut 

*M 

(U 
J-l 

VO 
< 

u 


6 C6.1. human 
3 C6.1. mouse 

37 C6.1.dog 

IB nt.comp 

14 human. pep 
16 mouse. pep 
39 dog. pep 

3 8 pep.comp 

15 TS10q23.3.pep 


6 C6.1. human 
3 C6.1. mouse 

37 CG.l.dog 

38 nt.comp 
44 human. pep 
16 mouse. pep 

39 dog. pep 
18 pep.comp 

15 TS10q23.3.pep 


6 C6.1. human 
3 C6.1. mouse 

37 C6.1.dog 

38 nt . comp 

14 human. pep 
16 mouse. pep 

39 dog. pep 
18 pep.comp 

1 5 TS10q23.3.pep 


6 C6.1 .human 
3 C6.1. mouse 
J7 C6.1.dog 

38 nt.comp 
44 human. pep 
16 mouse. pep 

39 dog. pep 
48 pep.comp 

15 TS10a23.3.oep 



BNSDOCID: <WO 



9S33907A1 I > 



WO 98/33907 



PCT7US98/00353 



14/27 




CTG 
CTG 
CTG 


i — » t— » L_l t 




o o O i 

3 5 2! 


Dl.CX.0l, i 




o o O i 

fr* fr* H i 


U LU tU i 




*C ** *r t 

3 3 5 

M fl ( j 


2 2 2 i 


2 








o u 5 I 


Q O Q 1 


Q 


E- fr- H 1 






u O O i 






*t *C *t i 

t: t • 


CO CO CO i 


CO 


-* E-* 1 






O O U i 
fr" E-» H i 


> > > 1 


> 


<C | 






O (5 C3 i 
<t *c i 


Q Q Q i 


o 


(J (J CJ 1 






O O O i 


a* a, a, ( 




< f-« o * 






CJ U U i 






*5 < *c i 
< o o * 


t- e t- 




^ H H i 






3 p O t 
r» £« H i 




> 


U U (J ■ 






E-« E-« E-» i 


to CO CO 


CO 


E- 1 H i 


CJ CJ CJ i 






< < < ! 


E-> H E- 


E^ 


< < < i 






O U CJ i 






- e- e— i 

C- E- « 


w w « 


to 


O O O i 






< < < 1 


CO CO CO 


1 cn 


O O U i 






e> o o i 






*C < < l 


to CO CO 


I CO 


r- t- E- t 


J CJ CJ I 






o o e> i 


< < < 


i < 


e> o o » 






< < < i 






e> o o i 


u u to 


• w 


< < o * 




o u cj < 






o o U t 


o. a ul. 


1 cu 


H t- CJ * 






< < < 1 






*C < < i 


~ 2: z: 


i 2 


< < < 1 






u u u • 






fc- t- E- i 


to CO to 


■ CO 


CD < < • 


U U (J i 






u u o 


a. a a 


i a* 


o o o 






< < < 






o e> u 


u u u 


I w 


O 15 O 




< < < 






cs o o 


u u u 


i w 


< < < 












o o o 


> > > 


i > 



< < 

CJ CJ 

33 

< < 

3 3 

U U 

< *c 

U E- 



E- E- E- 

H 6- 

O U CJ 

< < < 

E- 6- E- 

O < O 

f-» E-» H 



E- H 
&«; i£ 
E- E- fe- 
lt, u- u- 



C 

o 
o 

o 
CD 



C 0) 

m w a a 

E 3 Di 0) CD 

^ o o a a a cl 

£ ETJ E • -0) 

• • • O C 0) Q* 

HHi-l U flj (C • 

• • * - E 3 CJ> 
voovc u n O O 

uuu c; j-t E t> 



V£> n ^ CO tj» \D CTv CO LO 
^ ^ ^ 



WO 98/33907 



PCT/US98/00353 



15/27 



s s 

< < 

< < 
u u 

< H 
fr- E- 

< 

u o 

< < 

o u 

< < 

o o o 

(5 O O 

o o o 

I- H H 
1- E- H 
1^ fr- H 

" &y 



5 5 5 
B B 8 
555 

CD U U 

< < < 

o e> o 

< < < 

u u u 
u u u 
e- o 

< < < 
poo 



- H E- 
U U 
<C < 
O U 
U (J 
<■< 



o o o 

H H t- 

u u u 

m 

6* E-« f - 

< < < 
O O O 

< < < 
H E- H 

< < < 
fc- E- E- 

sss 

O U V 
tr* fr* t~> 

< < < 



> > 

o o 
E~ w 

x x 
o o 
a cx 



■ o 
i 



i o 

I 
I 

i a 



til U U i ti] 
I 

Q Q Q 1 O 
i 
I 

U. U. Lu i u. 

■ 

CU Ou CL i CL 
U U U 1 10 

s 2: 2 i e 

U W 03 i til 

CL CL EL I Q. 

I 

Q Q Q i O 
i 

CO W W 1 CI 

o a a i c 

H E- E-* I E- 

t 

H H E- t €- 

a a q i a 

t 
i 

CO CO CO I CO 
i 
t 

>»>-">• I >- 
i 
i 

k cc a: i o: 

t 

i 

> >« > i >, 

i 

x x x i x 



E-* 

E- 

U 

< 

E- 
t- 

t 

o 

< 
u 
< 

u 

E- 
< 

u 
< 

E- 
< 

U 
O 

a 
< 

E- 

E- 

O 

E- 

, U 



< 
< 

CD 
< 
E- 

E- 

E-« 
O 

s 

E- 
< 

< 
(J 
E-« 

8 

O 
E- 



ii ti H n 



O 

< It ti 

O H ti 

CJ V v 



o 

cn 

CD 
u. 



V V V V 



§3 O <u CD 

o o a a a a 

JZ E 'D E • • 0) 

. • • o c <u a 

HHH O (0 (I) * 

• • • • E 3 Dl 

VO VO VO *J a o o 

UUUC£ E'D 



a 
a 



a • 

E ro 

O f"M 

U V 
• o 
p. 1—1 
0) CO 



O O 
E 



ro *f *y m . 



a a 

QJ 0) 

o. tt a a 

e . . o 

■ • • p c a) a 

H *-H *-l O CO .10 * 

* • • ■ E a D> 

\0 \£> VO 3 O O 

u o a c x: e-o 



<" ^> ^ m ^rp r -1 



a 
a 



a • 

E m 

O <M 

o cr 
• o 

Oj r-H 

0) cn 



C (19 

ra to 

E 3 

=3 O 

£ E 



a 

QJ 

a 

q> qj Ou * 

a a a a e rn 
E * • a> o cm 
o c a) a u cr 

O tt) to * - o 

* E 3 D) fi— « 

u 3 o o ocn 

C £ E'D Qi fc- 



jvo nr-^^ux^cDiri 



C OJ 

(0 CO 

E 3 

3 O 



VO VO 



a a 

QJ QJ Qj 



H O 10 01 

• * E 3 
vo *J 3 O 



QJ O fN 
C QJ CL U 

♦ • o 
D) Di«-h 
O QJ CO 



U C£ E'D ae* 



^ ^ ^ ^ ^ ^ — 1 



WO 98/33907 



PCT/US98/00353 



16/27 



V V V V 



o 



TG 






< 






E- 






- 












r- 






A AT A 






< 






























E-* 












O 
E- 






5 
o 






o 






o 


















u 






o 












o 












o 






o 






< 






E- 






f- 












o 






o 






u 






E- 












< 












o 


















u 






o 






E- 






< 






< 












U 












E- 






< 






CJ 






u 






< 












cj 






o 






H 
























(3 






o 






< 






o 






u 






u 






E- 






E- 






t- 






o 






o 


II 


II 


< 


II 


11 




V 


V 



tl II II II II 



< 

E- 
U 

o 
< 

o 
u 

6- 

cj 
< 

B 

u 

e- 
o 
*c 
o 

CJ 



0) 

CL a ro 
Cn 0) 0) a. * 

o o, a a a e m 

*D E • • QJ O (N 

• p c a) a o tr 

H O (0 (/) . . o 

• • E 3 D> {X rH 

vo *j d o o a> co 
vocnr^corrvooNcom 

n ^ rn rp 



C 0) 
0J CO 

r» o 



vo vo 



0) 
CO 

=3 Dl 
O O 
E Tf 



VD VO VD U 



a o< 

cu cy 

a, a, a a 

e . . 0) 

p c o) a 

O 10 10 . 

£ 3 Dl 

3 0 0 



a 
a 



a - 

O CM 

u cr 
• o 

ttJ CO 



o o o q x: 5 x> 



vomr^co^vocncotn 

n ^ -TP m r-» 



C 0) 

row a a* 

3 go £ a a a 

• * * o c a> a 

H H H O (0 (D • 

VOVDVO AJ § O O 

o u o c .c e 



— — ^ 



a 
a 



a • 

O rva 

u cr 
• o 

a> co 
ah 



C CO 

«J co 

E 3 

3 O 

x; e 



VO VD 



a a 

cu cu a 

a a a a e 

E= . • CU O CN) 

o c <u a o a 

O «o w • • o 

• E 3 Di an 

u 3 O O (DO) 

C£ ETJ ah 



<^> TT* C> *3 



WO 98/33907 



PCT/US98/00353 



17/27 



u 
o 
o 
u 



< 

< 

< 
< 
< 

g 

6- 
O 
O 

o 



< 
< 

s 

U 
O 

o 
< 

o 



c 
c 



LL 



C 0J 
to w 

§D 
O 

£ E 



a 
a 



o* a 

Q) 0) 



o, a a a e ^ 

E • • 0) O oo 
O C Q) a u cr 

O ID (0 • * O 

*j 3 o o o co 



n n *; pi *t *— '— 



c o 

(TJ (0 

O O 
s: E T3 



vo vo u 



a a 

E • • <U 

o c <d a 

U (fl U) • 

E 3 CP 

3 o o 



a 
a 



a * 

E ro 

O fN 

o cr 
• o 

Qu f— l 

0) CO 



UUU C X E 'D 

7 r ^ ^ 



C 0) 
(0 w 

li 



a 

QJ 

a 

o> a) a) a • 
o a a a a e ro 

*0 E • • (U 0(N 

• o c a* Oi o cr 

H O (0 til • ■ O 

• ■ E 3 Di Q.i-t 
vo O 3 O O O) t/1 

u e 73 



-cp — — — — 



■ n v 



C 0) 

aJ to Qi Qi _ 

P 3 Di 0) 0) CL 

3 o o cx a a a e ^ 

£ ET3 E * • 0) ON 
. . .OCflJftU " 

H H H O TO W ■ • 

. . • • E 3 0> Qi 

vovovo u 3 O O CD cn 

m yr rn .— « 



RWKnnnin- <wn BS33907A1 i > 



WO 98/33907 



PCT/US98/00353 



18/27 



II II II II 



II II 11 II II It 



II tl II II II 



O li 

< 



H 

O A A 
O li h 
O II ii 
C3 

o 

u 
o 
o 
u 
u 
o 
< 

o 
o 

U 
t- 

u 
o 
o 
o 

u 

< 

o 
u 

u 
o 
u 

CJ 

8 

(J 

o 
< 
o 
o 
< 
o 

H 

o 

U 
(J 

u 

< 
u 

U 
t- 

u 

u 
o 
o 

< 

o 

o 

o 
o 
o 
o 
o 
o 

8 

u 
o 

8' 

e> 
o 

< 

< 

8 

< 



AAA 



H H H M H H M 
XMI-IMMMVHI-I.M 



ooooooooo 
ooooooooo 

OCJOUCJUOUU 

VD VD VD VO VO VO VO VO VO 

<<<<<<<<< 

uuuuuuuuu 

H M n tn T r m 



CQ CJ Q U tl* O X 

HHHHHHH 

r-trH)t-t«HHtHrH<-HH 
OOOOOOOOO 

OOUOOUCJUU 

VOVOVDVOVOVDU)U)VO 

f*. J 5 : < < < <• < < 
uoooouoou 



PQ CJ D W tu O X 
MMMMMMM 
<CQUQWU*OXH 

<<!cnoawu«ox 



ooooooooo 
uuuuuuuuu 

VOVO VO VD \D \£) VO VD 

<<<<<<<<< 

UOUUUOOOU 



HHHHHHH 
<DQUQWU,OXH 

<<CQaQWPu,OX 
XWHMHHHHH 



ooooooooo 
uuuuuuuuu 



vovovovovovovovovo 

<<<<<<<<< 

uuuuuuuuu 



WO 98/33907 



PCT/US98/00353 



19/27 



A A A A 



O II II 11 
U « » I' 



O 



A A A A A A A 
II II II II II tt II 
II II It II It II II 



3 



CJ 

o 
o 

g 

CJ 

o 

U 

u 
o 
< 
o 
o 
o 
u 
o 
e> 
o 

u 

u 

o 

o 
u 
u 



o 



H ra ro 
u * o i 

f- « u I 

O xj I 

U U u 

H O u 

H JJ ro 

o O ffl 

< Di Cn 
O to XJ 

< u u 

U U XJ 

O Oi Oi 

CJ <0 xJ 

CJ xJ <0 

U 10 D 

U U xj 

SID (0 
O) cq 

Oram 
O d> ro 
C3 © 

It- Cn 



Cn 
to 

ra 
ro 





ro 


< 


(0 




(0 


6- 


o 


u 


iJ 


fr- 


XJ 


o 


xj 


e> 


O 


o 


XJ 


o 


(D 


o 


cn 


< 


D* 


u 


xj 


o 


cn 




o 


< 


re 


o 


XJ 


u 


cn 




ro 


S 


XJ 


o 


XJ 


< 


D» 




ra 




XJ 


g 


ra 




CO 


t? 


XJ 


o 


XJ 


o 


ra 


o 


ra 


o 





O XJ (0 
O xj cn a a 
u ra n i] 



U U xj 

O xj xj 

O U xj 
Oreo 

O ra cn 

lr* Cn xj 

CD ■ xj ro 

CJ XJ ro 
U Dl u 

U ra o 

fr- ra cn 

U Cn xj 

U ra ra 

< u «j 

< xj ro 

< ra xj 
H ra u 
H ra ro 

h O U 

« Di 
e> Di xj 
O XJ xj 

u ra o 

O u ra 

rj> o xj 

o cn ra 

O ra xj 

CJ xj en 
U xj Di 

U xj ra 

*C XJ 4J 

O XJ U 
O xj Cn 
cj) u ra 

< U D 

U tn xj 

O xj ro 

fr* xj xj 

< D) U 



U XJ xj 
< Dl U 
CJ L> xj 

o ro ro 

O « ra 

O xj ra 

O en xj 

O xj cn 

O u xj 

O xj xj 

O xj o 

CJ xJ xJ 

O XJ cn 

O xj xj 

U xj (j 

O U TO 

O xj o» 

CJ xj xj 

O U ro 

O xj AJ 

U D> u 

< xj ra 
U cn cn 

' XJ XJ 

- O xj 

cj ra u 
cj u ra 

fr* *J xj 
CJ xj cn 
CJ xj ra 

U xj cn 

ra ra 

O D> ra 

< ra xj 
h O u 
U U xJ 
H xj xj 

t- XJ XJ 

u o ra 
cj ra xj 

U D» xj 

O xj cn 

h U D> 

O xj Di 



c 
o 
o 



CQ O Q W U« O X 

M M l-H M M M IH 

<CJUQU(uO JCH 
MXXXXXXXX 



ooooooooo 
OOOOOOOOO 

uuuuuuuuu 

VO VO VD VO VO VD VO VO VD 

<<<<<<<<< 

UUUUUUUUU 



ffl U Q U O I 



<fflUQtdtnUlH 
n X X X X X X X X 
<<(QUOW^OI 



ooooooooo 
ooooooooo 

VO VO VD VO V£> VO VO VO VO 

<<<<<<<<< 

ooooooooo 



ffl U D W ^ U I 

M M W H M M t-l 
MXXXXXXXX 



ooooooooo 
ooooooooo 
ooooooooo 

VOVOVOVOVOVDVOVOVO 

<<<<<<<<< 
OOOOOOOOO 

H fM n in LP r. m 



CQ O Q W tn O X 

HHHHHHH 

<CQODWtJu03CM 
WXXXXXXXX 

XHHHHHHI- 



ooooooooo 
OOOOOOOOO 
OOOOOOOOO 



vo vo vo vo vo vo vo vo vo 
<<<<<<<<< 

ooooooooo 



BNSDOCI0: <WO 9833907 A 1 [ > 



WO 98/33907 



PCT/US98/00353 



20/27 



Jj ffl Ol JJ 

JJ D> J-> A tO 

U ID iJ II 

o» u ti 

(D JJ 

D> jj 



CD jj o 

CJ Ol ffl 
U U u 
U U JJ 
H " 
CJ 
CD 
CJ 
CJ 



ffl JJ 
ffl JJ 



JJ U 



Ol J-> 

O 

Cn id 
JJ to 
Ol f0 



s 

8 

O 
CJ 

CJ U Oi 

€-* JJ jj 

H U U 

U DU 

fr* 10 U 

cd jj jj 

u JJ JJ 

H U (0 

H O *J 

< co to 
fr* JJ ffl 
£■« Oi ffl 

< ffl 01 

O H3 O 

CD ffl JJ 

< JJ to 
O co jj 
O U jj 

CJ ffl ffl 

h Di D 

fc-« JJ ffl 

CD JJ ffl 

U JJ o» 

CJ JJ ffl 

CD JJ JJ 

" D> ffl 
D) Cn 



JJ 10 

jj to 



< 

CJ 

cd 
cd 

< JJ O 

cd jj jj 

cd jj jj 

CJ ffl CP 

o o> Ol 

CD JJ ffl 

O JJ JJ 

O *JJ JJ 

CD u o> 

O Oi jj 

U JJ ffl 

O JJ JJ 

CD JJ JJ 

< JJ ffl 

CD Cn U 

CD Cn jJ 

O O) U 

CJ o> JJ 

h u s- 

u 

CD JJ JJ 
O to jj 
U to ffl 

CJ U JJ 
U U ID 

CD Cn ffl 

CD JJ 



CD 



. Cn u 

CD Cn u 

' re jj 

v Oi 

C jj 

o> to 



CJ ffl U 

CJ W JJ 

< JJ Oi 

CJ « ffl 



V 
JJ 
CD 



ffl 



ffl 
u 



Cn 
cn 



u o 



< 


u 


JJ 


Ol JJ 


ffl JJ 




to 


ffl 


JJ ffl 


ffl u 


< 


JJ 


JJ 


ffl CJ 


JJ O) 


o 


ffl 


JJ 


U ffl 


JJ JJ 


CJ 


(0 


JJ 


ffl JJ 


ffl u 


H. 


(0 


to 






CD 


JJ 


JJ 


Jj 


Oi ffl 


CD 


ffl 


JJ 


JJ 


ffl Ol 


CJ 


u 


ffl 


ffl 


ffl JJ 


£ 


to 


Cn 


JJ 


JJ JJ 


fr* 


Jj 


Cn 


JJ 


JJ o> 


CJ 


to 


JJ 


Oi 


Jj ffl 


CD 


Jj 


JJ 


JJ 


ffl u 


CD 


Jj 


Oi 


JJ 


Cn jj 


CJ 


Cn 


JJ 


u 


Cn jj 


u 


O) 


JJ 


XJ 


Jj Jj 


CJ 


JJ 


JJ 


Jj 


CJ ffl 




ffl 


ffl 


ffl 


Cn JJ 


fr> 


to 


JJ 


ffl 


ffl Jj 




(0 


Ol 


Oi 


o» JJ 


*C 


XJ 


o» 


ffl 


JJ ffl 


CJ 


to 


Ol 


Jj 


JJ u 


CJ 


u 


Oi 


ffl 


ffl JJ 




ffl 


to 


ffl 


JJ JJ 


<- 


u 


to 


Jj 


JJ JJ 


CJ 


to 


Ol 


Jj 


ffl ffl 


CD 


to 


ID 


u 


Jj JJ 




JJ 


JJ 


ffl 


JJ JJ 


CJ 


JJ 


to 


JJ 


Jj Jj 


CD 


ffl 


u 


CJ 


CO JJ 


H 


JJ 


u 


JJ 


JJ JJ 


CJ 


Oi 


o 


ffl 


JJ ffl 


CD 


CO 


ffl 


Jj * 


U jJ 


CJ 


CJ 


ffl 


O) 


JJ ffl 


CJ 


u 


to 


to 


u 


CD 


to 


JJ 


CJ 


Oi 


*c 


JJ 


CJ 


10 


JJ 


CJ 


Jj 


Jj 


ffl 


JJ 




JJ 


u 


JJ 


'ffl 




CJ 


to 


re 


JJ 


CJ 




Jj 


Ol 


ffl 


o 


to 


CJ 


Jj 


Cn 


H 


JJ 


to 


Ol 


JJ 








Oi 


JJ 


CJ 


JJ 


JJ 


Ol 


ffl 


CJ 


JJ 


ffl 


Ol 


JJ 




CJ 


Cn 


Oi 


Oi 




u 




JJ 


c 


CJ 


ffl 


ffl 


ffl 


JJ 


CD 


JJ 


u 


JJ 






u 


ffl 


to 


(J 


CJ 


Ol 


Ol 


ffl 


o 


8 


ffl 


JJ 


JJ 


JJ 




ffl 


ffl 


ffl 


ffl 




o» 


re 


JJ 


JJ 


CD 


o» 


Oi 


JJ 


ffl 


< 


JJ 


ffl 


ffl 


o> 


CJ 


Cn 


ffl 


JJ 


JJ 


CJ 


JJ 


JJ 


JJ 


o 


U 


ffl 


JJ 


ffl 


JJ 


CD 


u 


JJ 


1_> 


u 


CD 


CJ 


ffl 


JJ 


ffl 


< 


JJ 


JJ 


JJ 


JJ 


CJ 


Ol 


JJ 


JJ 


JJ 


CD 


JJ 


to 


cn 


Oi 




u 


to 


o> 


ffl 


< 


10 


JJ 


JJ 


Oi 


CD 


JJ 


to 


JJ 


ffl 




CO 


to 


JJ 


Oi 


1 


(0 


JJ 


o 


cn 




Ol 


Ol 


o» 


ffl 


< 


Jul 


JJ 


ffl 


Oi 




u 


ffl 


ffl 


cn 




o 


re 


JJ 


Oi 




ffl 


JJ 


JJ 


ffl 


CD 


JJ 


Oi 


JJ 


ffl 




to 


JJ 


JJ 


JJ 


CJ 


JJ 


Oi 


o 


JJ 


CD 


ffl 


JJ 


JJ 


JJ 


CD 


u 


o 


ffl 


JJ 


6- 


JJ 


JJ 


re 


Ol 


CJ 


u 


re 


JJ 


JJ 


fr* 


JJ 


to 


JJ 


JJ 


CJ 


JJ 


ffl 


cj 


ffl 


CJ 


JJ 


JJ 


ffl 


ffl 


CD 


JJ 


re 


CJ 


JJ 


a 


JJ 


JJ 


ffl 


to 




Ol 


O) 


Jj 


Oi 


CJ 


JJ 


JJ 


re 


JJ 


CJ 


o 


JJ 


u 


ffl 



<_>jJCJfflJJfOfflff)JJ 
l-f ffljjjjjj cn nj jj re 
Cn O U ffl cn ffl ffl cn 
.J JJ Cn o ffl jj jj «j to 
CDJJffljJfflffljjrere 
*c jj jj jj re ffl cn jj jj 



Ol ffl 
ffl ffl 
U JJ 



CJ Oi JJ (J ffl JJ 
tO Ol JJ JJ ffl 



jj jj o> JJ 

Jj JJ JJ JJ 



Oi 
O) 
Cn 



2 



m " 



ffl JJ 

0 ffl 

ffl re 

01 JJ 

co jj to re jj jj 



ffl jj ffl u 



jj jj 

U jj jj 
Ol 



jj jj re ffl 



ffl jj « in 

JJ JJ CJ ffl 

re oi jj re 



re JJ u 

CJjJretouoitoto'ffl 
ooiiauiDuoimu 
CJffljJcnjJOijJtojj 
CDreoifflurejjfOjj 

*C ffljjjj U JJ jj jj jj 
<COfflJJffliJiJff}(0 
ffl CO ffl JJ XJ u CO JJ 

jjjjjjjjfaoiroffl 
jjjJCncnfflUffljj 
jJfflfflfflCJjJfflro 

JJjJjJfOOlfflfflffl 
OiDijJffl ffl ffl ffl ffl 
JJfflOlJJjJfDjJfO 
COJJjJOfflffljJfO 
tO ffl JJ jj U JJ jJ jJ 
JJ Cn JJ JJ (J JJ JJ jj 
CJjJCnjjfDffljJCn 
Cn jj jj ro jj cjjjjj 
recjfflcjjjjjou 

JJJJfOfflOlfflJJjJ 

OJJUfflfflffljJfO 
tOJJOiCJCJjJjJffl 

ffl jj cn cn u jj jj to 
jjjjtaiJUfflnjffl 

tOOiUfflffljJOiffl 
UOiJJJJOlfflfOffl 
JJjjjJOiUCnUJJ 
jjjjffljjrejJtoco 
jjjj cn jj jj u oi to 

JJjJfOJJCJCflxJOi 

CJOfflOlOiJJOlOi 
JJjJfflfflCtiJJOtJJ 
JJ Cn jj ffl jj 



CJ CJ jj _ 
Ol JJ JJ Oi 
ffl ffl ffl jj jj Ol ffl 



ffl JJ JJ CO 

re ffl ffl ffl 
ffl jj U ffl 



Cn ffl u 



o 

U CJ 
ffl JJ 



OlOiJJjJjJjJjjjj 

cn ffl cn cn cnjj o>to 

OiffljJffljJjjjj (j 
JJfflOifflfflfflffljJ 
Oi to jj jj jj jj rou 
fOfflrojjcofflrecn 
jJCnjjjjreCJUj.) 
tQOiffljJjJCJCnffl 
JJfflfflUffljJjjjj 
UjJjJjjjjfOJJffl 



CD D) JJ ID U Ol ffl JJ CO 
CD tore u j jj cn jj jj 
CD CJ Oi Oi Cn ffl ffl jJ xJ 
CD CJfflrOJJfflOOffl 
<JJO)UOiJJf0f0ffl 
CDCJjJfflfflUJJOlffl 
OffljJUUJJffljJro 
CDJJjJfflUJJCnjJffl 
< JJ ffl ffl JJ jj ffl OiJJ 
OrOJJfOfflmUJJjJ 

*£ jj Oi jj ra ffl cn jj u 
CJ jj Cn jj Cn ffl jj jj oi 
CD JJ JJ 0> Ol U JJ JJ jj 
CDUffljJJJjJrOtOOi 
CJCJfOnOOiOiJJffl 
CDUfflffljJOifflJJU 
CDtOJJjJCJfflffliJffl 
CJUJJJJjJtOUOlJJ 
CDCJUffljJjJjJfOjj 
<. fflfflfflJJjJffljJOl 



< Ol CJ jj jj u 

CJ jj jj re jj re 

CJ JJ jj ffl ffl u 

CD ffl jj ffl jj jj 



Oi ffl JJ 

tO Oi JJ 

Oi JJ jj 

JJ ffl Oi 



CJ CJ < 
fr* U CD 
CJ < < 




CD ffl JJ jJ u O jJ JJ CJ 

jj jj CJ jj jj 

u 
< 

CJ 
CJ 
CD 
< 

a 



u jj 

re jj ffl jj jj re jj 

ffl JJ CO ffl jj jj jj 

JJ jj jj jj jj Ol JJ 

JJ cn jj jj jj jj jj 

CJ JJ jj cn jj re jj. 

U jj jj cn jj jj jj 

jj jj cj re cj oi cj 



E- jj cn ffl cn jj 

o jj jj o jj cj 

f- u JJ JJ u JJ 

k-* jj jj re jj jj 

H JJ JJ JJ JJ u 

fr* jj jj re re Ol 

ffl cj jj jj cn 

CJ JJ Ol JJ JJ 4J 

O tO Ol JJ CJ JJ 

E" 1 cn jj u jj jj 



CJ to cn re jj to 

cj u oi re jj re 

E -1 jJ JJ jj jj (j 

CJ JJ tO ffl jj jj 

f- JJ ffl CD JJ jj 

CJ ffl JJ ffl JJ xj 

f- JJ JJ ffl jj jj 

Offl Cn ffl u jj 

fr- 1 CJ jj ffl cn jj 

< Oi JJ u o» JJ 



ffl JJ JJ 
JJ JJ JJ 



ffl u 

Oi JJ 

JJ JJ o< 

Oi JJ 



Jj JJ Jj JJ 
JJ Jj ffl 

jj re cj 



CJjJjJ CnCnjJ ffl Ujj 
CJUJJJJfflfflUjJ 

OoiJJOirecjoioi . 
Fr-JJfOfflUfflCnjJCn 



bud Wfcoi 

H H H H H H H 
<CQUQU[i,OIH 

XMMMMMMM1-H 



OOOOOOOOO 

uauuuuuuu 

V£> VO VD KO \0 VjO VO \0 VO 
<<<<<<<<< 

uuoouoooo 



CQ O Q W U. O X 

^HHHHHHHH 



OOOOOOOOO 

uouuououo 

y) \o vo vo \o vo vo y! vo 

uuuuuuuuu 



M M IH HH M M M 



OOOOOOOOO 

uuuuuuuuu 

vovovovovovovocovo 
UUUUUUOUO 

I h n n <r tn vo op ns_._ 



H HH IH H H H W 

<<WUQWDuOS 



OOOOOOOOO 

OOOOOOOOO 
UUUUUUUUU 



VOVOVOVOVOVOVDVOvO 

<<<<<<<< " 
UUUUUUUU 

T-irom^Ktnvpr^co 



BNSDOC1D: <WO 9833907 A 1 



WO 98/33907 



PCT/US98/00353 



21/27 




WO 98/33907 



PCT/US98/00353 



22/27 



4J U 

D) cn 

xj u 

CO JJ 

iJ CO 

xj to 

xJ (0 

XJ (0 

XJ XJ 

(0 x> 



cn x> 
xj cn 

IQ 10 



Cfl x> 
B cn 

XJ XJ 

cn u 
xj <o 

(0 (0 
Cfl XJ 



u cn 
xj cn 



co o 



tT> CO 
»(D (0 
»U (0 



xj cn 

XJ XI 

a u u 

Cn (0 CO 

Oi XJ CO 

XJ U XJ 

Cfl Cn to 

to co xj 

Cn Cn <o 

co co cn 

xj xJ Cn ! 

u u m 

(J xj co 

xj U CO 

co XJ Cfl 

co cn xj 

CO xJ CO 

t0 CT XJ 

xj xj cn 

u o cn 

O O CO 

Cn xj co 

it Cn co co 

II co XJ co 

v xj xi xj 









CO 






Cn 


Cn 






Ca 


nj 




U 


en 


cn 




CO 


XJ 


tn 














CO 


Cn 






Oi 


CO 








*o 




CO 


CO 


<0 




O 


<0 


*0 






CO 


<0 




O 


XJ 


XJ 




to 


U 


CO 




Cn 


Cn 


XJ 




XJ 


Cn 


XJ 




Cfl 


en 


XJ 




*0 


Cfl 


XJ 




XJ 


D) 


Cn 




Cfl 


CO 


A3 




XJ 


Cn 


O 




CO 


Cn 


O 




XJ 


Cn 


XJ 




XJ 


U 


XJ 




u 


D> 


CO 




XJ 


XJ 


XJ 




Cn 


Cn 


XJ 




Cn 


Cn 


XJ 




XJ 


CO 


XJ 




o> 


to 


Cn 




XJ 


<o 


Li 




O 


o 


(J 




XJ 


cn 


U 




XJ 


cn 


XJ 




Cn 


cn 


XJ 




XJ 


Cn 


XJ 




O 


XJ 


U 




XJ 


O 


CO 




CO 




CJ 




o» 


CO 


o> 




*o 


CO 


cn 




Cfl 


co 


xJ 






Cfl 


CO 




CO 


XJ 


cn 




CO 


O 


co 




CO 


Cn 


<0 




en 


XJ 


c0 












XJ 


XJ 


CO 




Cfl 


Cn 


CO 




Cn 


XJ 


XJ 


u 


to 


XJ 


XJ 


u 


XJ 


Cfl 


<0 


u 


co 


XI 


cn 


XJ 


XJ 


XJ 


Cn 


cn 


O 


XJ 


TO 


XJ 


XJ 


Li 


XJ 


U 


D» 


XJ 


XJ 


O 


Cn 


XJ 


to 


CO 


XJ 


CO 


(0 


CO 


XJ 


CO 


(J 


Cfl 


XJ 


XJ 


XJ 


CO 


XJ 


u 


XJ 


CO 


XJ 


o 


ca 


Cn 


CO 


XJ 


CO 


Cn 


CO 


XJ 


CO 


XJ 


CO 


XJ 


CO 


to 


CD 


XJ 


XJ 


XJ 


(0 


O 


XJ 


Cn 


XJ 


U 


XJ 


XJ 


XJ 


XJ 


CO 


<0 


XJ 


XJ 


u 


Cn 


XJ 


JJ 


CO 


to 


CO 


XJ 


Cfl 


XJ 


XJ 


XJ 


Cn 


u 


(0 


CO 


CO 


CO 


ca 


O 


(0 


Cn 


u 


Cfl 


to 


XJ 


u 


XJ 


«0 


O 


CO 


U 


u 


JJ 


Cn 


Cn 


XJ 


XJ 


cn 


f0 


cn 


XJ 


Cn 


CO 


XJ 


CO 


XJ 


CO 


cn 


CO 


CO 


CO 


XJ 


u 


U 


CO 


XJ 


O 


Cfl 


CO 


XJ 


O 


CO 


u 


XJ 


Cn 


XJ 


XJ 


XJ 


co 


XJ 


Cn 


U 



PQ U Q ttl [x, o X 
<<<!fflUD WtuOK 



ooooooooo 
OOOOOOOOO 

uuuoooouo 
ooouoouuu 



ii u cn ti 
II co co n 



o o 
tn co 



t0 xj 
xi to 
xj cn 



xj cn 

XJ XJ 

CD jj 



co Cn 
cn co 
CA cn 



D) o> 

xj co 

CO (0 

co cn 

Cn xj 

cn U 

Cn co 

xj co 

JJ O 

JJ U 



» cn 
CO xJ 
U U 



cn cn 
cn xj 
cn u 



CD XJ 

cn u 
co u 



co u 

Cn xj 

co u 

co to 

co co 

Cn Cn 

Cfl cn 

to co 

cn xj 

co cn 

Cfl u 

cn to 

Cn xj 



to xj 

co en 

Ot xj 

D> cn 

CO XJ 

xj co 

Cn cn 

to xj 

« u 

Cn co 

it ii xj cn 

II ll co co 

V v xJ O 



CO 

c 
to 



Cn 
U 
Cn 



co co 

xj cn 

o cn 

U CO 

xj CO 

xj Cn 

xj cn 

CO to 

CO CO 

XJ to 

to to 

U xj 

Cn Cfl 

xj Cfl 

xJ to 

XJ to 



<0 CO 
CO Cfl 



cn ca 

<0 CO 

L> xJ 

o c 

xj to 

to Cfl 

CO Cn 

xj cn 

U XJ 

Cn xj 

Cn u 

to to 

co Cn 

U xj 

II O XJ 



V XJ XJ V 



ooooooooo 
ooooooooo 
uuuouoouu 

u>vovovovovou>vovo 
f\ ?*i fx < < < < < < 

























o> 






Cn 






CO 






CO 






co 






D> 






U 






U 






XJ 






XJ 






XJ 






tn 






Cfl 






to 






Cn 






m 






Cfl 






CO 






CO 






x> 






U 






Cn 






Cfl 






JJ 






XJ 






Cn 






CO 






O 






Cn 






XJ 






XJ 






XJ 






Oi 






to 






to 






XJ 






CO 






o» 






CO 






o 






to 






CO 






u 






Cfl 






Cn 






XJ 






Cn 






XJ 






XJ 






O 






to 






u 






cn 






XJ 






xj 






XJ 






U 






O 






u 






u 






to 






• XJ 


II 


II II 


O 


II 


II II 


u 


V 


V V 


XJ 



►H M W M M M M 

<<CQUQU3tL < 3S 
KHHHHHHMH 

rH«-4<HtHi-l*Hi-HiHi-H 
OOOOOOOOO 

ooooooooo 
uuuuuouuu 

V£> V£) ID VO VO VO vo 



4 



WO 98/33907 



PCTAJS98/00353 



23/27 



r7» 



o 



=7* 



.00 
-4.00 
-5.00 



FIG. 11A 



Hydrophilicity Window Size = 7 



Scale = Kyte-Doolittle 




400 



o 

O 
<Q 

1- 

to 



FIG. 11B 




400 



1.25 
1.20 
1.15 H- 
1.10 
1.05 
1.00 -to' 
0.95 -f 
0.90 
0.85 
0.80 
0.75 



FIG. 11C 




50 



100 



150 



200 



250 



300 



350 



400 



x 



cz 



1.00 
0.80 
0.60 
0.40 
0.20 
0.00 
-0.20 
-0.40 
-0.60 
-0.80 
-1.00 



FIG. 11D 







































tTiiVn 










itjiy 













































50 



100 



150 



200 



250 



300 



350 



400 



RNsnncinr <wo 9333907 a 1 I > 



WO 98/33907 



PCTVUS98/00353 



24/27 



X 
o 



£ 



CO 
o 



O- 

£ 



1.40 
1.20 
1.00 
0.80 
0.60 
0.40 
0.20 
0.00 
-0.20 
-0.40 
-0.60 
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Amphiphilicity Window Size = 1 1 
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FIG. 12A 

Hydrophilicity Window Size = 7 Scale = Kyte-Doolittle 
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FIG. 12D 

Hydrophilicity Window Size = 7 Scale = Kyte-Doolittle 
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